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Head start 


Europe’s mega- project to simulate the human brain has much to offer neuroscience 
research — whether or not it delivers on its central promise. 


far enough that it is now possible for us to imagine a day when 
we will understand the murky workings of our most complex 
organ: the brain. 

True, that day remains distant, but scientists are no longer consid- 
ered crazy if they report a glimpse of it on the horizon. This turning 
point has been marked by the independent launches this year of two 
major brain projects: US President Barack Obama's Brain Research 
Through Advancing Innovative Neurotechnologies (BRAIN) Initia- 
tive and the European Commission’s Human Brain Project. Even if 
they fail to achieve the ambitions the research community sets for 
them, they are signals of a new confidence. 

Right now, the two projects are not equal. The BRAIN Initiative is 
in an early phase of development, and has so far been promised little 
new money. The impetus behind it was a brash proposal by a group 
of neuroscientists for a billion-dollar project to measure the activity 
of every neuron in the human brain. That ambition was lost on the 
starting block when peers, justifiably, deemed it scientifically inap- 
propriate — but is yet to be replaced by a single goal of equivalently 
Apollo-program proportions (see page 26). This may make it hard to 
maintain the political support large projects always need. 

Conversely, the Human Brain Project — headquartered in Swit- 
zerland, where it will soon relocate from Lausanne to its new base in 
Geneva — has 135 partner institutes and is blessed with a plenitude of 
money and planning. And it has a romantic Moon-landing-level goal: 
to simulate the human brain in a computer within ten years, and provide 
it to scientists as a research resource. Programme leaders have commit- 
ted €72 million (US$97 million) to the 30-month ramp-up stage; those 
monies started to flow into labs after the project’s launch last month. The 
project has a detailed ten-year road map, laden with explicit milestones. 


f rom supercomputing to imaging, technologies have developed 


UPS AND DOWNS 
But will the Human Brain Project realize its pioneers’ greatest ambi- 
tion? Many have raised doubts, arguing that we understand too little 
about how the brain works to make the ambition feasible. However, the 
project’s scope has matured since the idea was first mooted in 2010, in 
ways some believe may increase its chances. Initially, it was conceived as 
a programme whereby bottom-up experimental data — electrophysi- 
ological, anatomical or cellular — would feed into a supercomputer 
without preconceived ideas of how the simulated neuronal circuitry 
might organize itself. A top-down element has now been introduced. 
The bottom-up data feed — mostly from research on mice — remains 
a core component, but how it is processed in the brain simulator will 


space, time and numbers, are processed in the brain. For example, in one 
major research project, around ten people will be selected for repeated 
study during the decade-long project. Their ‘reference brains’ will be 
measured using a range of non-invasive techniques such as functional 
magnetic resonance imaging and electroencephalography to work out 
how the relevant neurocircuitry is organized during specific tasks. The 
detailed bottom-up data will have to align with this broad architecture. 

Another subproject, on neurorobotics, will build or simulate robots 
to provide ‘bodies’ in which to test whether simulated brain models 

really do elicit the behaviours anticipated. 


“Major Supercomputing has proved too slow for 
breakthroughs real-time brain simulation, so other sub- 
will continue projects will focus on developing faster 
to emerge from supercomputers, as well as neuromorphic 
individual computing, which can theoretically simulate 


brain activity orders of magnitude faster than 
occurs in a real brain (see page 22). Neuroin- 
formatics, medical informatics and ethical challenges are all in there too. 

The Human Brain Project may still fail to deliver on its central 
promise, at least at the desired degree of sophistication. It remains a 
high-risk initiative, and keeping the unwieldy, multidisciplinary con- 
sortium on track may also prove difficult. But the risks are spread over 
the subprojects, some of which will inevitably add significantly to our 
sum neuroscience knowledge. 

And some will inevitably achieve the European Commission's origi- 
nal goal for the project — it is often forgotten that it was not specifically 
about the brain at all. In 2010, the commission launched a call for 
proposals for billion-euro, ten-year flagship projects that would push 
the development of information and communication technologies 
while offering a benefit to society. The Human Brain Project, with 
its relevance to brain disorders in an ageing society, was one of two 
winners. This heritage, which may turn out to be its greatest strength, 
explains its unusual interdisciplinarity. 

The project will absorb more than €1 billion in the next ten years, half 
of which will come from the European Commission and half of which 
will be raised by participants, including a large chunk from Switzerland. 
But it is much more international than this implies. Although most of 
the partner institutes are in Europe, some are further afield, in countries 
including Israel, Japan and the United States. The battle for the brain 
could be won faster with a global effort, and many will be attracted by 
the Human Brain Project's systematic approach. 

But before getting too starry-eyed about mega-projects, let's remem- 
ber that major breakthroughs in understanding the brain will continue 


investigators.” 


be guided by the findings of one of the Human 
Brain Project’s 15 subprojects, on high-level 
human cognitive architecture. This will gener- 
ate data for both animals and humans, describ- 
ing how cognitive tasks, such as those involving 


NEW ANGLES ON THE BRAIN 


A Nature special issue 
www.nature.com/neuroscience2013 


to emerge from the labs of individual investiga- 
tors. The journey towards a full understand- 
ing of the brain will be long and uncertain, and 
there will be ample opportunity for individual 
contributions to help point the way. = 
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Historic work 


Governments need to strengthen support for 
scientists who preserve our cultural heritage. 


thousands of artworks that were fashioned from rocks during the 

Neolithic period and the Bronze Age. Threatened by degrada- 
tion, such cultural heritage attracts scientists and volunteer citizens 
to ensure its preservation. 

The tools that researchers have devised to help in this task are them- 
selves creative. In one project, biogeochemists and geomorphologists 
have developed non-invasive methods that enable researchers and 
citizens to monitor and mitigate decay. Scientists interested in protect- 
ing historic collections are determining how climate change will affect 
the rates of chemical degradation of paper and silk, pest damage and 
mould growth. Then there is optical coherence tomography, which 
uses reflections of laser light to provide three-dimensional analyses of 
structures that are micrometres beneath opaque surfaces. Extensions 
in the spectral range of this technique are revealing features that are 
valuable for the conservation and historical analysis of works of art. 

Such research diversity and much more, practised in many countries, 
is devoted to the preservation and restoration of humankind’s historical 
and cultural heritage. Who could question the value of such research? 
Who would doubt the impact of studies that help to protect heritage 
tourism, a major contribution to many economies? Who could under- 
estimate the benefit to young people and their teachers in science and 
history? No one does. In fact, government ministers in various nations 
along with research-funding agencies, the European Commission, 
regional authorities and the United Nations Educational, Scientific and 
Cultural Organization all support such science in one way or another. 

And yet these heritage researchers struggle. Many have been par- 
ticularly hard hit by cutbacks in national and regional funding. But 
more important is their lack of visibility and influence for attracting 
long-term support; in that sense, the diversity of their techniques and 
subject matter undermines them. They also lack research prestige, as 
judged by the conventional standards of scientific assessment — their 


[: Ireland, parts of England and other areas of Europe there are 


work is published in low-profile journals, in museum reports and in 
other ‘grey’ literature. And so bright young researchers see too few 
career prospects, and leave the field. What a waste. 

Last week saw a notable gathering of heritage scientists at a confer- 
ence in London — notable because of the variety exemplified above, 
but also because one can speak of them as a community, and one that 
is much more coherent than in the past. This is thanks to a sequence 

of events in the United Kingdom that 


“What is most many countries would do well to emulate. 
important is A scientific committee of the House of 
to build long- Lords, the upper house of Britain’s parlia- 
lasting capacity —_ ment, recognized in 2006 that such research 
in heritage was important and neglected. As a result 


of that analysis, the national Science and 
Heritage Programme was established in 
2007, jointly funded by the Arts and Humanities Research Council 
and the Engineering and Physical Sciences Research Council. The 
programme has funded nearly 50 projects involving 200 researchers. 

Yet funding is only half the story. It is also crucial for the commu- 
nity to be coherent in its actions and in improving its visibility. Here 
progress has been made, but much more needs to be done. There has 
been too much dependence on championship of the field by highly 
committed individuals. More collective approaches are being estab- 
lished, such as the National Heritage Science Forum in Britain and 
a global forum developed by the International Centre for the Study 
of the Preservation and Restoration of Cultural Property in Rome. 
An agenda of engagement for increasing the influence of UK herit- 
age research was aptly described at last week’s conference by heritage 
scientist May Cassar of University College London (see go.nature.com/ 
bxndif). And a European Commission project initiated the Heritage 
Portal website (www.heritageportal.eu), which has the potential to 
highlight the field’s achievements. But such forums have a way to go in 
establishing themselves as forces of influence, and the Heritage Portal 
is seriously under strength. 

What is most important is to build long-lasting capacity in heritage 
research: a combination of multidisciplinary centres of excellence and, 
above all, a need for science ministers and universities to ensure that 
new permanent academic posts are established. These are essential to 
strengthen the backbone of heritage science, and so secure the future 
of our common past. m 


research.” 


Follow the crowd 


The behaviour of millions of minuscule beads 
reveals some secrets of collective motion. 


ability of crowds to make collective decisions. Flocks of starlings 

twist in unison like smoke swirls in a summer sky, and shoals of 
fish tack and veer as if in response to electric shocks. Locusts swarm 
and herds of humans can head in very unwise directions indeed. Even 
simple bacteria show collective behaviour. 

Individuals in each of these systems have very different abilities to 
communicate with each other, to actively pass on information about 
their intended actions, so why does collective behaviour across all scales 
look so similar? Is there some unknown sensation that allows the indi- 
viduals that comprise such seemingly intelligent crowds to steer; some 
distant wisdom? Although such behaviour is easy to observe, it has 
proved hard to capture in simple physical models. If we could master 
it, the information that this might yield could help engineers to develop 
swarming robots and design safer crowd-control measures. 

On page 95 of this issue, researchers in France report that they 


Test claimed wisdom is disputed, but no one should doubt the 
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have induced collective motion in millions of tiny plastic beads. The 
miniature spheres, they say, can sense the orientation of their rolling 
neighbours and adjust their own actions accordingly. In this way, the 
scientists can encourage the beads to follow the crowd, simply by pour- 
ing more of them into the system. 

The scientists — Denis Bartolo and his colleagues — squeezed a 
conducting liquid suspension of the beads into a miniature racetrack 
sandwiched between two glass plates, and watched what happened 
when they applied an electric field. An electrohydrodynamic curiosity 
called Quincke rotation causes the beads to twitch and then start to 
roll. At first, they head off at constant speed but in all directions. Then, 
as more beads are added and their number passes a critical point, the 
individuals form a crowd and their individual motion coalesces into 
coherent movement in a unified direction — just like that of a flock 
of birds. This happens, the scientists say, because the rolling spheres 
can sense the orientation of their neighbours through simple hydro- 
dynamic and electrostatic interactions. 

From plastic balls to intelligent dust: there could be interesting impli- 
cations here for work that aims to harness self-propelled and swarming 
microparticles, for example, to diagnose disease 
or improve communications. The writer Mark 
Twain said: “Whenever you find yourself on the 
side of the majority, it is time to pause and reflect” 
But sometimes, the majority really does rule. = 
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TOM RACHMAN 


CC ill these hands neer be clean?” In 
Shakespeare's play Macbeth, Lady 


WORLD VIEW .ecnssicor sen 


Cleaning damned spots 
from the obsessive mind 


Exposure therapy helps some people affected by compulsive washing, but 
often the contaminant is intangible, says Stanley Rachman. 


Macbeth helps to plot the bru- 
tal murder of King Duncan. Afterwards she 


NEW ANGLES ON THE BRAIN 


A Nature special issue 
www.nature.com/neuroscience2013 


treatment. A common technique to treat com- 
pulsive washing — and one that I helped to 
develop — aims to reduce the fear, in the expec- 
tation that this will also reduce the need to wash. 


feels tainted by Duncan's blood and insists that 
“all the perfumes of Arabia” could not sweeten her polluted hands. 
Baffled by her compulsive washing, her doctor is forced to admit: 
“This disease is beyond my practise.” 

In the 400 years since Macbeth was first performed, other doctors, 
psychiatrists, neuroscientists and clinical psychologists — myself 
included — have also found the problem beyond the reach of their 
own expertise. We see compulsive washing a lot, mostly as a symp- 
tom of obsessive-compulsive disorder (OCD), but also in people who 
have suffered a physical or emotional trauma, for example in women 
who have suffered sexual assault. The events trigger a deep-seated 
psychological, and ultimately biological, response. 

We know that the driving force of compulsive washing is a fear of 
contamination by dirt and germs. An obsessive fear of contact with 
sexual fluids, for example, can drive compulsive 
washing in OCD and force people to restrict 
sexual activity to a specific room in the house. 
Compulsive washing fails to relieve the anxiety. 
Most patients with OCD continue to feel con- 
taminated despite vigorous attempts to clean 
themselves. Why does repeated washing fail? 

There is much debate at present about the 
direction that psychiatric medicine and research 
should take. We should not underestimate what 
we can continue to learn from the careful obser- 
vation of patients. Such observations have led 
my colleagues and me to diagnose a new cause 
of OCD and other types of compulsive washing: 
mental contamination. 

Lady Macbeth could be an example of someone suffering from this 
psychological problem. Mental contamination can be evoked without 
contact with a tangible contaminant. It is a feeling of internal dirtiness 
caused by a psychological or physical violation. The source of the pol- 
lution is not an external contaminant such as blood or dirt, but human 
interaction. The affected person develops strong feelings of contami- 
nation that are evoked by direct contact with the violator or indirect 
contacts such as memories, images or reminders of the violation. 

Commonly, these patients are unwilling or unable even to speak the 
name of the violator. Milder forms of this mechanism are prevalent in 
society — in the course of a bitter divorce, for example. The emotional 
violations that can cause mental contamination include degradation, 
humiliation, painful criticism and betrayal. There is a moral element in 
most cases of mental contamination. Separate psychometric research has 
confirmed mental contamination as a coherent and measurable concept. 

This discovery has large and immediate implications for clinical 


THE SOURCE 
OF THEIR 


PROBLEMATIC 


CONTAMINATION IS 
NOT PHYSICAL BUT 


MENTAL. 


Called exposure and response prevention, it asks 
the patient to repeatedly touch the contaminating object or substance 
— rubbish, for example — and encourages them not to wash or clean 
as they then want to. The therapy aims to untangle the psychological 
‘conditioned’ association between fear and the source of the fear. 

Exposure and response prevention is a feature of cognitive behav- 
ioural therapy for OCD and other mental disorders across the world. 
Yet it is ademanding treatment. Up to one-quarter of patients drop out 
or refuse it, and up to three in ten of those who remain fail to improve. 
Over the past decade or so, it has become clear that one reason that 
these people are not helped by exposure therapy is that the source of 
their problematic contamination is not physical but mental. Feelings 
of mental contamination are diffuse, mainly internal and not acces- 
sible. In these cases, therefore, repeated hand-washing is misdirected. 
And so, crucially, is exposure therapy based on 
physical contact. 

In an early study of 50 young women who had 
experienced a sexual assault, we found abundant 
evidence of contact contamination and also 
mental contamination. After the assault they felt 
polluted, and understandably engaged in vigor- 
ous washing. However, a substantial minority of 
the women continued to feel polluted for many 
months after the event. 

Mental contamination is essentially a cogni- 
tive disorder. The patient’s memories, thoughts 
and images are the root of the problem. They 
need a cognitive approach, and my colleagues 
and I have developed an effective cognitive treatment for mental con- 
tamination. We unravel the circumstances of the violation and why the 
patients believe that they remain under threat. After working with many 
patients, we carried outa recent (non-blind) study with 12 people with 
severe OCD, most of whom had not been helped by traditional cogni- 
tive behavioural therapy (A. E. Coughtrey et al. Cogn. Behav. Prac. 20, 
221-231; 2013). Using the notion of mental contamination, and treating 
it accordingly, we managed to reduce the symptoms significantly for 
nine of them. We now plan to conduct a full-scale randomized con- 
trolled trial. If the effects of the therapy are confirmed, it would have a 
major impact. We would at last be able to treat the many patients who 
are currently, like Lady Macbeth, beyond our help. = 


Stanley Rachman is emeritus professor at the Institute of Psychiatry 
in London, UK, and the University of British Columbia, Canada, and 
author of The Fear of Contamination. 

e-mail: rachman@mail.ubc.ca 
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RESEARCH HIGHLIGHTS 


Micromotors detox 
chemical weapons 


Synthetic micromotors might 
provide a faster, better way to 
decontaminate nerve agents. 

Converting harmful 
organophosphate chemicals, 
such as sarin, into innocuous 
substances requires high 
concentrations of hydrogen 
peroxide and constant stirring 
over long periods. Such 
conditions are challenging 
when eliminating chemical- 
weapons stockpiles in remote 
or hostile locations. 

Joseph Wang at the 
University of California, 

San Diego, and his team 
overcame these requirements 
using micromotors made 

of polymers tipped with 
platinum. These zip around the 
contaminated fluid, propelled 
by bubbles that are produced as 
platinum reacts with peroxide. 
In small-volume lab tests, 

this micromixing neutralized 
nerve agents at low peroxide 
concentrations and without 
external stirring. 

Two out of three nerve 
agents tested with micromotors 
were almost completely 
detoxified within 40 minutes. 
The strategy could provide a 
general method to accelerate 
reactions, the authors say. 
Angew. Chem. Int. Ed. 
http://doi.org/f2ndgr (2013) 


Selections from the 
scientific literature 


Two routes to huge blooms 


The world’s largest flowers develop in two flowers, they found that the chambers of the 


distinct ways. 


The plants Rafflesia and Sapria are two 
groups within the same family (Rafflesia keithii 
pictured). Their gigantic flowers look similar, 
smell like rotting meat and form large chambers 
to lure carrion flies as pollinators. 

But when researchers led by Charles 
Davis of Harvard University in Cambridge, 
Massachusetts, examined gene-expression 
patterns and physical structures of developing 


Seasonal shift for 
reindeer eyes 


As the seasons change, Arctic 
reindeer (Rangifer tarandus; 
pictured) alter the colours at 
the backs of their eyes. 

In cats and some other 
mammals, a reflective surface 
called the tapetum lucidum 
reflects light through the 
retina a second time and 
improves sight in dim light. 
Glen Jeffery of University 
College London and his 
colleagues studied this 
structure in reindeer captured 
within two weeks of the 
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two plant groups come from different tissues. 


This suggests that the groups evolved their most 
characteristic features independently. 

The study might also help to explain how the 
flowers of some Rafflesia species can get as big 


as one metre across — the largest of all single 


summer and winter solstices. 
The tapetum lucideum 
changed from golden in 
summer to deep blue in 
winter. The winter colour 
seems to scatter more light 
across the retina. 
Proc. R. Soc. B 280, 20132451 
(2013) 


How bees seea 
steady landing 


Lacking stereoscopic vision, 
bees have evolved another 
technique to ensure that they 
land safely. 

Emily Baird of Lund 
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blossoms — without collapsing. The flower 
chambers are derived from the petal whorl, 
which could provide structural reinforcement. 
Proc. Natl Acad. Sci. USA http://doi.org/pq9 (2013) 


University in Sweden and her 
colleagues discovered that 
honeybees (Apis mellifera 
ligustica; pictured) alight safely 
by controlling the rate at which 
their landing zone expands in 
their field of vision. Keeping 
this ‘optic flow constant means 
that bees slow down as they 
near their targets. 

Baird’s team discovered this 
technique by studying bees as 
they landed on vertical discs 
with a rotating spiral pattern. 
Optical illusions that slowed 
down or sped up the apparent 
optic flow caused bees to 
change their flight speeds 
accordingly. Rather than 
landing, many bees would 
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either crash into the spiral or 
hover in front of it. 

This simple method of 
adjusting speed for landings 
could be widespread among 
animals and might even be 
useful in flying robots in the 
future, the authors suggest. 
Proc. Natl Acad. Sci. USA 
http://doi.org/psp (2013) 


GENETICS 


More risk genes 
for Alzheimer’s 


The largest genetics study so 
far of late-onset Alzheimer’s 
disease has identified 11 new 
genome regions that alter the 
risk of the disease. 

A team of some 200 scientists 
in the International Genomics 
of Alzheimer’s Project 
carried out meta-analyses 
of four separate genome- 
wide association studies, 
profiling 17,008 people with 
Alzheimer’s and 37,154 people 
who did not have the disease. 
The researchers reproduced 
their findings by genotyping 
an independent group of 
8,572 Alzheimer’s cases and 
11,312 controls. 

Their results confirmed 
nine out of ten known 
Alzheimer’s risk genes. Eleven 
newly identified regions 
may give more clues about 
the biology of the disease, 
for which there are still no 
effective treatments. 

Nature Genet. http://doi.org/ 
ppm (2013) 


PALAEOCLIMATOLOGY 


Deep oceanisa 
heat sink 


A long-term reconstruction 

of temperatures in the Pacific 
Ocean reveals how its heat 
content has changed since the 
last glacial era — findings that 
could augment climate models. 


Yair Rosenthal at Rutgers 
University in New Brunswick, 
New Jersey, and his colleagues 
analysed sediment cores 
in Indonesia’s Makassar 
Strait, and used the mineral 
composition of tiny sea 
creatures’ shells to calculate 
past water temperatures. 

This showed that, at 
depths of 500-900 metres, 
temperatures in the Pacific 
Ocean were typically higher 
over the past several thousand 
years than at present. Over 
millennia, the waters have 
undergone prolonged cooling 
and temperatures are now 
increasing faster than at any 
time in the past 10,000 years. 

The results support other 
studies suggesting that the 
ocean is currently taking 
up more heat, temporarily 
slowing the rise in average 
global air temperatures. 
Science 342, 617-621 (2013) 


ASTRONOMY 


Explosions in the 
young Universe 


The biggest ever thermonuclear 
blasts happened in the early 
Universe, when primordial gas 
clumps collapsed and created 
the seeds of supermassive black 
holes. 

Arising even before many 
stars and galaxies had time to 
form, the origins of big black 
holes have been a puzzle. 
Daniel Whalen of Los Alamos 
National Laboratory in New 
Mexico and his colleagues 
used computer simulations 
to reveal how gas in early 
galaxies could have cooled 
rapidly, forming stars that then 
exploded. The stars quickly 
collapsed back on themselves, 
generating turbulence that 
caused nearby gas clumps to 
form baby black holes. 

In related work, the same 
team ran other analyses 
predicting whether light 
from those supernova blasts 
would be visible to upcoming 
space missions. The James 
Webb Space Telescope, due 


RESEARCH HIGHLIGHTS MiiSaiaa¢ 


COMMUNITY 


CHOICE 


Cells grow in sleeping brains 


> HIGHLY READ 
on jneurosci.org 


Sleep seems to wake up a type of brain cell 
that maintains healthy neuron signalling. 
be : Chiara Cirelli and her colleagues at the 


University of Wisconsin-Madison analysed 
support cells called oligodendrocytes that wrap around 
neurons and protect them with a specialized sheath that helps 


their signals to travel faster. 


They collected oligodendrocytes and their precursor cells 
from mice after the animals had experienced a few hours 
of sleep or wakefulness and then profiled the expression of 
newly transcribed genes. Genes that were most active during 
wakefulness were involved in cell differentiation and death. 
By contrast, genes that were expressed during sleep promoted 
cell growth and the production of protective sheaths. 
Further experiments showed that the rate of growth of 
oligodendrocyte precursor cells doubles during sleep, and 
that cell production increased when mice spent more time in 
the stage of sleep associated with dreaming. 
J. Neurosci. 33, 14288-14300 (2013) 


for launch in 2018, should 

be able to detect these stellar 
explosions, the authors find. 
Astrophys. J. 777,99; 110 (2013) 


ENTOMOLOGY 


Male mosquito 
boosts egg-making 


Male mosquitos deliver more 
than sperm when they copulate 
with females: they also provide 
ahormone that spurs egg 
development. 

A team led by Flaminia 
Catteruccia at the Harvard 
School of Public Health 
in Boston, Massachusetts, 
studied the malaria-spreading 
mosquito Anopheles gambiae, 
focusing on a protein that is 
more abundant in females after 
mating. Tests using mosquito 
ovaries showed that this protein 
helps to direct nutrients into 
developing eggs (pictured, 


red shapes): when the protein 
was silenced, fewer eggs were 
produced (left) than when it 
was expressed (right). 
Further work showed that 
a molecule in male mating 
secretions binds to this 
female protein and boosts 
its expression, which in turn 
increases egg production. 
This is the first 
demonstration in insects 
of an interaction between a 
male hormone anda female 
protein that changes female 
reproductive success, the 
authors say. Interfering with 
it, perhaps by creating males 
that lack the hormone, might 
help to control populations of 
disease-carrying mosquitoes. 
PLoS Biol. 11,e1001695 (2013) 
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SEVEN DAYS wescnisi 


POLICY 


Chemical weapons 
Syria has destroyed all 
equipment essential for 

the production and mixing 
of chemical weapons, 

an international team of 
inspectors announced on 

31 October. The news, issued 
jointly by the United Nations 
and the Organisation for 

the Prohibition of Chemical 
Weapons (OPCW) in The 
Hague, the Netherlands, 
came one day before the 
deadline set by the OPCW. 
By 15 November, the OPCW 
Executive Council must 
approve a plan submitted by 
Syria to destroy its chemical 
weapons stockpile. 


Flu fights 

The Dutch university at which 
virologist Ron Fouchier 

and his colleagues created 
mammalian-transmissible 
strains of the H5N1 avian 
influenza virus has appealed 
against a court ruling obliging 
it to seek an export permit to 
publish such research. The 
Erasmus Medical Center in 
Rotterdam, the Netherlands, 
obtained a permit for a paper 
Fouchier published last year 
(S. Herfst et al. Science 336, 
1534-1541; 2012), but filed 

a legal challenge that sought 
to classify such work under 


NUMBER CRUNCH 


29% 


Studies that remain 
unpublished from 585 large 
clinical trials registered 

with ClinicalTrials.gov and 
completed before 2009. See 
go.nature.com/sknhdp for 
more. 

Source: C. W. Jones et al. Br. Med. J. 
347, (6104 (2013) 


New dolphin species reported 


A previously unknown species of humpback 
dolphin has been found off the coast of 
Australia. Researchers led by Martin Mendez 
of the Wildlife Conservation Society in New 
York identified the as-yet-unnamed species 
(pictured) from an analysis of 180 skulls and 
235 tissue samples of humpback dolphins from 
around the world (M. Mendez et al. Mol. Ecol. 
http://doi.org/pp6; 2013). The team proposes 


a regulatory exemption for 
basic research. The challenge 
was struck down in a court in 
September. See go.nature.com/ 
uhtxy3 and page 19 for more. 


Grant freeze ends 
Officials in Texas have 

lifted a 10-month freeze on 
grant funding at the state- 
supported Cancer Prevention 
and Research Institute of 
Texas (CPRIT) in Austin. 

A moratorium was placed 

on CPRIT after concerns 
about its grant-making 
process prompted a criminal 
investigation of the agency. See 
go.nature.com/xsloah for more. 


Marine reserves 
The Commission for the 
Conservation of Antarctic 
Marine Living Resources has 
failed to agree on a proposal 
to ban fishing in 1.25 million 
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square kilometres of the Ross 
Sea. The plan, discussed at a 
meeting in Hobart, Australia, 
that ended on 1 November, 
would have set up the world’s 
largest marine reserve in what 
some researchers say is the 
most endangered area of the 
polar region. The commission, 
made up of representatives 
from 24 countries and the 
European Union, has failed 
twice before to agree on similar 
proposals. See go.nature. 
com/133el5 for more. 


} FUNDING 
Genomic research 


Mexican billionaire Carlos 
Slim Heli, the world’s second- 
richest man, has donated 
US$74 million to the Broad 
Institute in Cambridge, 
Massachusetts, for genomic 
research. The gift from the 


that humpback dolphins in the Pacific and 
Indian oceans should be reclassified as three 
species instead of two: Sousa plumbea and Sousa 
chinensis. The humpback dolphin family — 
which has a characteristic hump just below the 
dorsal fin — includes at least one other species 
in the Atlantic Ocean. The discovery could 
affect conservation policies for the dolphins, 
which are threatened by habitat loss and fishing. 


Carlos Slim Health Institute 
will fund collaborations with 
Mexican scientists to find 
genetic variants that affect 
the risk of diseases such as 
certain cancers, type 2 diabetes 
and kidney disease in Latin 
Americans, and follows a 
previous gift of $65 million 
to the Broad made by Slim’s 
foundation in 2010. 


Brain Project move 


The European Union's Human 
Brain Project, launched last 
month, has unexpectedly 
scrapped plans to build a 
dedicated facility in Lausanne, 
Switzerland. On 29 October, 
officials at the Swiss Federal 
Institute of Technology 
Lausanne (EPFL) announced 
that the prestigious, billion- 
euro project will instead 
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move close to Geneva. The 
programme will shift into the 
former Swiss headquarters 
of the pharma giant Merck 
Serono, now leased to EPFL 
and Geneva University. 


| RESEARCH 
Dark matter 


The world’s most sensitive 
dark-matter experiment 
announced its first results 

on 30 October, reporting no 
sign of the elusive substance. 
Astronomical observations 
point to the presence of dark 
matter in space, but attempts 
to detect it directly passing 
through Earth have produced 
conflicting results. The 
findings from a 110-day search 
at the Large Underground 
Xenon experiment in Lead, 
South Dakota, did not confirm 
three earlier experiments that 
had reported hints of dark- 
matter particles. See go.nature. 
com/lhcojd for more. 


Hybrid eclipse 


A rare hybrid solar eclipse took 
place on 3 November (pictured 
in northern Uganda), as the 
path of totality moved across 
the North Atlantic Ocean and 
Africa. The relative positions of 
Earth, the Moon and the Sun 
allowed observers in Africa 

to see a total eclipse. In parts 

of the Atlantic, where Earth’s 
curvature placed its surface a 


TREND WATCH 


Global carbon dioxide emissions 
increased by just 1.1% in 2012, 
compared with an annual average 
of 2.9% since 2000, according to a 
report released on 31 October by 
the Netherlands Environmental 
Assessment Agency. After 
growing by roughly 10% a year 
for a decade, Chinese emissions 
increased by just 3% in 2012, 
owing to slower economic 
growth and more low-carbon 
energy. Emissions from the 
United States and the European 
Union fell by 4% and 1.6%, 
respectively (see chart). 


little farther from the Moon, 
observers saw an annular, or 
‘ring of fire’ eclipse, with the 
Moon only partially blocking 
the Sun. This type of eclipse has 
not occurred since 1854 and 
will not recur until 2172. 


Mars launch 

India has launched a 
spacecraft to Mars. The Mars 
Orbiter Mission (informally 
called Mangalyaan) lifted 

off on 5 November, aiming 
to reach orbit around the 

red planet in September 
2014. The 4.5-billion-rupee 
(US$73-million) mission is the 
country’s first interplanetary 
probe. 


| _BUSINESS 
False drug claims 


US regulators announced 

on 4 November that Janssen 
Pharmaceuticals has pleaded 
guilty to making false claims 
about its schizophrenia drug 
Risperdal (risperidone). 
Janssen, based in Titusville, 
New Jersey, has agreed to pay 
more than US$1.6 billion. 


The US Food and Drug 
Administration approved 
Risperdal for schizophrenia 
and acute mania, but Janssen 
also marketed the drug 

for some elderly patients 

with dementia, despite the 
increased risk of stroke in 
older patients taking the drug. 


Prenatal testing 
AUS court has invalidated a 
patent at the heart ofa territory 
war in the booming industry 
of non-invasive prenatal 
testing. On 30 October, a 
California federal judge issued 
the ruling on a patent licensed 
to Sequenom of San Diego. 
Sequenom had attempted to 
invoke its patent in order to 
block other companies from 
selling their own non-invasive 
prenatal tests — a move that 
could have led to an industry 
monopoly (see Nature 486, 
454; 2012). Sequenom says that 
it will appeal against the ruling. 


Alcohol agency 


Neurobiologist George Koob 
of the Scripps Research 
Institute in La Jolla, California, 
has been appointed to head 
the US National Institute on 
Alcohol Abuse and Alcoholism 
(NIAAA). In January 2014, he 
will replace interim director 
Kenneth Warren, whose five- 
year stint saw the National 
Institutes of Health consider, 
and later scrap, a plan to merge 
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SEVEN DAYS | THIS WEEK | 


9-13 NOVEMBER 
The Society for 
Neuroscience meets in 
San Diego, California. 
Major brain initiatives 
in the United States and 
Europe are among the 
highlighted topics. 
go.nature.com/eubkfx 


12 NOVEMBER 

The International 
Energy Agency releases 
its World Energy 
Outlook 2013, which 
looks at oil resources 
and demand, and energy 
efficiency to 2035. 
go.nature.com/7j3jw1 


the NIAAA and the National 
Institute on Drug Abuse (see 
Nature 467, 643; 2010). 


Maddox Prize 


Pharmacologist David Nutt 
was awarded the John Maddox 
Prize for Standing up for 
Science on 4 November. The 
prize honours people who, in 
spite of challenges, promote 
science in the public interest. 
Based at Imperial College 
London, Nutt was fired 

from his post as the United 
Kingdom's chief drugs adviser 
in 2009, after criticizing the 
government's drugs policy. 
The award is named after a 
former editor of Nature, and it 
is jointly awarded by Nature, 
the Kohn Foundation in 
London and the UK campaign 
group Sense About Science. 


CLARIFICATION 

The item ‘Data dash’ (Nature 
502, 598; 2013) may have 
implied that NASA's rapid 
data transmission was the 
result of faster laser beams. 
In fact, the record was set 
thanks to greater bandwidth. 


> NATURE.COM 
For daily news updates see: 
Www.nature,com/news 
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Athenat+, a planned X-ray observatory that would be the most powerful ever flown, is likely to be launched in 2028. 


2 


X-rays top space agenda 


European agency selects mission themes, with X-ray telescope the biggest winner. 


BY ELIZABETH GIBNEY 


n X-ray space telescope has taken pole 
A= sition ahead of a gravitational-wave 

observatory in the race to be Europe’s 
next major space-science mission. 

On 31 October, European Space Agency 
(ESA) director of science and robotic explo- 
ration, Alvaro Giménez, put forward the “hot 
and energetic Universe” and the “gravitational 
Universe” as the two themes that will shape 
the agency’s next large (L-class) projects — 
the €1-billion (US$1.4-billion) L2 and L3 
missions, scheduled for launch in 2028 and 
2034, respectively. ESAs Science Programme 
Committee, composed of member-state rep- 
resentatives, will consider the recommenda- 
tion and announce a decision at a meeting 
on 28-29 November. It would be rare for the 
committee not to endorse the science direc- 
tor’s advice. 

The first of the themes is good news for 
Athena+ (the Advanced Telescope for High 
Energy Astrophysics), an X-ray observatory 
that would be the front runner in a formal 
call — planned for next year — for mission 


proposals to probe the hot and energetic 
aspects of the Universe. The observatory, which 
willcarrya telescope and instruments to meas- 
ure the energy of X-ray sources and image them 
with unprecedented precision, would study 
how hot gas evolves into galaxy clusters and 
how black holes grow. Luigi Piro, a member of 
the Athena+ instrument and optics working 
group at Italy’s National Institute for Astrophys- 
ics in Rome, says that half of all visible matter 
is in this ‘hot phase, but is poorly understood. 
“We will now be able to tackle questions about 
how the Universe is actually working and what 
is the role of hot plasma and black holes in 
shaping the Universe,’ he explains. 

The second theme would focus on gravita- 
tional-wave astronomy, a field never before 
studied in space. The planned space-based 
mission eLISA (evolved Laser Interfero- 
meter Space Antenna) would detect the low- 
frequency gravitational waves that are thought 
to stretch the fabric of space-time. Gravita- 
tional waves have not been directly detected 
at ground-based observatories. eLISA would 
bounce lasers between three spacecraft 
at least one million kilometres apart, and 


spot a passing wave when it alters the precise 
positioning of one of the spacecraft. 

eLISA would detect the signals from 
colliding supermassive black holes and be able 
to trace the evolution of galaxies, says Karsten 
Danzmann, director of the Max Planck Insti- 
tute for Gravitational Physics in Hannover, 
Germany. Because gravitational ripples can 
pass undisturbed through matter that affects 
electromagnetic waves, eLISA will enable 
scientists to ‘see’ further back in time, to the 
earliest days of the Universe. It is also hoped 
that eLISA will tackle questions in fundamen- 
tal physics, such as whether general relativity 
holds at the edges of black holes. 

For proponents of eLISA, a launch date 
of 2034 is frustrating, because a pathfinder 
mission to test the necessary technology is 
set to fly in 2015 after several years of delays. 
Danzmann says that the proposal that the 
X-ray observatory should be the first to launch 
reflects a desire to concentrate on the “slightly 
less risky” of the two projects. 

But Randall Smith, an X-ray astronomer at 
the Harvard-Smithsonian Center for Astro- 
physics in Cambridge, Massachusetts, 
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> admits that he was surprised that his 
field was picked ahead of gravitational-wave 
astronomy, given that in previous competi- 
tions, including the US Decadal Survey, which 
decides future NASA missions, the latter had 
been given the edge. 

Many astrophysicists think that the ideal 
situation would be to go for both projects in 
parallel, because the two observatories would 
be looking at some of the same targets, such as 
black holes, albeit through very different eyes. 
Bringing the gravitational mission forward 
would require a cash injection. But Danzmann 
says that China has already expressed an inter- 
est in getting involved with a space-based 
gravitational-wave detector. And NASA, which 
pulled out ofa similar joint mission in 2011, 
might be able to reconsider contributing after 
the launch of the James Webb Space Telescope 
in 2018, he suggests. 

ESAs science programme schedules three 
missions of this scale every 20 years (see “X-ray 


X-RAY SPECS 


The European Space Agency is very close to finalizing the following line-up of large (L-class) missions. 


The magnetic field of Jupiter’s moon Ganymede 


Black holes and galaxy clusters. 


Mission Name Launch date | Study targets 
(Lal JUICE 2022 
2 Athena+ 2028 
L3 eLISA 2034 


specs’). In May last year, the Jupiter Icy Moons 
Explorer (JUICE) was approved as the first of 
three missions under ESA’s Cosmic Vision pro- 
gramme, intended for the L1 launch opportu- 
nity in 2022. At that time, previous incarnations 
of Athena+ and eLISA missed out on selection. 
The L2 and L3 science themes were based on 
recommendations from ESA’ independent 
Senior Survey Committee, chaired by astrono- 
mer Catherine Cesarsky, a former director- 
general of the European Southern Observatory. 

The current two winners were selected 
from 32 proposals presented at a meeting in 


Gravitational ripples from black-hole mergers. 


September. Other options considered were a 
y-ray telescope, a mission to explore interstel- 
lar space and a host of proposals to explore 
moons and planets in the Solar System. 

Although disappointed that no plan- 
etary missions have been put forward, Colin 
Wilson, a planetary physicist at the University 
of Oxford, UK, says that his field will still be 
busy in the coming decade, through planned 
ESA missions to Mercury, Mars and Jupiter. 

But, he adds, exploring the farthest realms of 
the Solar System, beyond Saturn, will be a real 
challenge in the coming decades. m 


Precision gene editing paves 
way for transgenic monkeys 


Despite political challenges, engineered primates could be better disease models than mice. 


BY HELEN SHEN 


nthony Chan spent two years creating 
A: first five monkeys in the world to 
be genetically engineered with human 
mutations — in this case, for Huntington's dis- 
ease. But three of the five monkeys, reported 
in 2008, developed severe symptoms of Hun- 
tington’s much more quickly than anticipated, 
and had to be killed within a month of birth’. 
Viruses used to introduce the relevant gene had 
inserted extra copies randomly, intensifying the 
symptoms — and highlighting the method's 
limitations in creating animal disease models. 
Chan, a geneticist at Emory University in 
Atlanta, Georgia, and other scientists around 
the world are now eyeing precision genome- 
editing techniques that solve such problems 
by using enzymes and RNA instead of viruses. 
Many have high hopes that transgenic mon- 
keys will mimic human genetic conditions 
more faithfully than mice — and thus permit 
better drug-development tests. Many also say 


NEW ANGLES ON THE BRAIN 


A Nature special issue 
www.nature.com/neuroscience2013 


Marmosets are among the primates that may soon be engineered with custom genetic-editing methods. 


that the primates will accelerate basic research 
in neuroscience, permitting researchers to map 
and test complex neural circuits underlying 
behaviours that do not exist in simpler organ- 
isms. “We weren't even able to think about this 
before,’ says Chan. 

Work is already under way in a few labs. 
Erika Sasaki, a geneticist at Keio University in 
Tokyo, was part of the team that used viruses 
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to produce the first transgenic marmoset to 
pass on an engineered gene to its offspring’. 
On 11 November, at the annual meeting of the 
Society for Neuroscience in San Diego, Cali- 
fornia, Sasaki and her colleagues will present 
efforts to model autistic behaviour in mar- 
mosets using mutations produced with zinc- 
finger nucleases — a class of DNA-cutting 
enzyme. Next month, the Salk Institute for 
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CRISPR CUTS ~ 2 The RNA sequence, linked 
Scientists have developed a to a DNA-cutting enzyme, 
precise way to disable genes. X finds the right spot on the 
genome. 
1 | Engineer a section of DNA-cutting 
RNA to target a specific enzyme 


region of DNA. 


Guide sequence 


4 | While the DNA is broken, it 
is vulnerable to strategic 
insertions of small amounts 
of engineered DNA that 
could alter the gene. 


Biological Studies in La Jolla, California, is 
hosting a workshop to bring together molecu- 
lar biologists, bioengineers and neuroscien- 
tists to work on challenges in the field. “This 
is something with tremendous potential,” says 
Terrence Sejnowski, head of the computational 
neurobiology laboratory at the Salk Institute. 

Neuroscientists have long wanted transgenic 
monkeys. Disorders such as autism, schizo- 
phrenia and Alzheimer’s disease cannot be 
fully replicated in mice, which lack the complex 
cognitive and social abilities of primates. More- 
over, many neuroactive drugs that showed early 
promise in mice have failed in human trials. 

But researchers have persisted in working 
with mice, mainly because there is a targeted 
gene-editing strategy that works in the ani- 
mals. It relies on extremely rare, spontaneous 
DNA-swapping events to alter or disable cer- 
tain genes. The method requires several ardu- 
ous steps: designing the desired mutation and 
introducing it to embryonic stem cells; detect- 
ing it in the tiny fraction of stem cells that 
incorporate it; integrating those stem cells into 
a developing mouse embryo; and then hoping 
that when the ‘patchwork animal matures, its 
sperm or eggs will carry the mutation, allowing 
the creation of transgenic offspring. 

The approach is feasible only because mouse 
stem cells are relatively cheap to maintain and 
screen, and because mice reach sexual matu- 
rity within weeks and produce large litters. 


> 
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“You don't have that luxury with monkeys,’ 
says Edward Callaway, a neuroscientist at the 
Salk Institute. Marmosets, for example, take 
15 months to mature sexually, carry babies for 
5 months and often give birth to only 2 young. 
Macaques take even longer — 3 years to reach 
sexual maturity, and they typically produce a 
single baby after a 5.5-month gestation. 

But custom monkeys have become a possibil- 
ity thanks to efficient gene-editing techniques 
that can support a method in which embryos 
are manipulated one at a time. In one method, 
zinc-finger nucleases home in on specific 
genomic regions, and cut genes open to disrupt 
their function or allow researchers to substitute 
in external DNA. In another, called CRISPR, a 
customizable RNA fragment is used to guide 
a DNA-cutting enzyme to the right spot (see 
‘CRISPR cuts’). Feng Zhang, a synthetic biolo- 
gist at the Massachusetts Institute of Technol- 
ogy (MIT) in Cambridge, showed in May that 
CRISPR could be used to make precise muta- 
tions in multiple genes in mouse embryos’. 
That, he says, could pave the way for monkey 
models of human brain disorders that involve 
more than one gene. 

“We now have, for the first time, the pos- 
sibility of trying some of the treatments for 
genetic-based diseases in primates,” says 
Robert Desimone, director of the McGovern 
Institute for Brain Research at MIT. 

MIT researchers are now testing CRISPR on 
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fertilized monkey eggs, in collaboration with 
researchers from the Oregon National Primate 
Research Center in Beaverton. They will start by 
disabling genes. MIT neuroscientist Guoping 
Feng, one of Zhang’s collaborators, wants to 
disrupt a gene called SHANK3, which has been 
implicated in some human cases of autism. 
More sophisticated changes, such as swapping 
ina different version ofa gene, will require more 
work and engineering, says Zhang. 

Eventually, he adds, CRISPR might be used to 
label specific monkey neuron types or control 
them with light — just as is currently possible 
with mouse neurons. Basic neuroscientists are 
especially keen to see those applications suc- 
ceed. Anthony Movshon, a monkey-vision 
researcher at New York University, has watched 
over the past decade as scientists have used 
genetic tools to stimulate, inhibit and record 
neural activity in mice and flies. But for many 
important areas of neuroscience — cognition, 
attention, memory and decision-making — 
those animals are “not that interesting’, he says. 

All of this excitement comes at a time when 
monkey research has become increasingly dif- 
ficult. In January, following years of pressure by 

animal-rights activ- 


“We now have, ists, United Airlines 
for the first time, announced that it 
the possibility of — wouldstop transport- 
trying some of ing research monkeys 
the treatments — eliminating the 
for genetic- last North American 
based diseases air carrier available 


to primate research- 
ers (see Nature 483, 
381-382; 2012). And in April, the New Eng- 
land Primate Research Center in Southborough, 
Massachusetts — one of eight such centres sup- 
ported by the US National Institutes of Health 
— announced that it would close after gradually 
transferring its animals to other facilities. 

Despite the controversies, some research- 
ers say that using transgenic monkeys may be 
the best way to search for treatments for brain 
diseases and to investigate how networks of 
neurons create consciousness. “You want to 
match the model system to the question you're 
asking,’ says Movshon. “It’s just as unethical to 
use a mouse for an experiment for which it’s 
not suited.” m 


in primates.” 


1. Yang, S.-H. et al. Nature 453, 921-924 (2008). 
2. Sasaki, E. et al. Nature 459, 523-527 (2009). 
3. Wang, H. et al. Cell 153, 910-918 (2013). 
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Scientists tested rugged new equipment for the Transportable Array near Toolik Lake, Alaska, in 2011. 


US seismic array eyes 
its final frontier 


Moveable sensor grid will begin monitoring Alaska next summer. 


BY ALEXANDRA WITZE 


n Maine’s rugged coast, just north of 
() the tourist town of Boothbay, an under- 

ground seismometer is listening for 
earthquakes. Engineers activated it on 26 Sep- 
tember, completing the US$90-million Trans- 
portable Array, an ambitious effort to blanket 
the contiguous United States with a moveable 
grid of seismic monitors (see ‘On the marclv). 

Since 2004, the set of 400 seismometers, 
loaded on trucks, has marched gradually east- 
wards across the continent, from the Pacific 
coast across the Rocky Mountains and the 
Great Plains to reach the eastern seaboard. At 
each spot, technicians dig holes to bury the 
instruments in plastic tanks underground. 
The process has drawn the best picture yet of 
the North American part of Earth’s mantle, 
reaching hundreds of kilometres beneath the 
surface. The array has illuminated how slow- 
motion earthquakes shimmy along the coast 
of the Pacific Northwest, and how molten rock 
rises in the hot spot deep beneath Yellowstone 
National Park in Wyoming. 

“As the array has moved, the whole picture 
of what’s under North America has gotten 
much sharper,’ says Andy Frassetto, a seismol- 
ogist at the Incorporated Research Institutions 


for Seismology (IRIS) in Washington DC, 
which operates the stations. 

Now, the Transportable Array’s operators 
are looking to the far north, in what may be its 
toughest challenge yet. Having almost finished 
the job in the lower 48 states, the seismometers 
will start to be relocated next spring to Alaska 
— by far the most seismically active US state, 
and not thoroughly monitored yet. The project's 
seasoned engineers will have to fly many of the 
instruments to remote locations by helicopter, 
then drill into frozen ground and install the 
seismometers with battery packs to keep them 
working through the long northern winter. (A 
few additional stations will be installed across 
the Canadian border, in the Yukon Territory.) 

The Transportable Array, along with other 
permanent and temporary seismic stations, 
is one of three cornerstones making up the 
larger EarthScope initiative. EarthScope was 
conceived as a way to combine different geo- 
physical views of the deep Earth to provide 
data on a grand scale for researchers working 
across North America on all aspects of geosci- 
ence. EarthScope’s second component comes in 
the form of Global Positioning System instru- 
ments that detect tiny changes in ground move- 
ment, such as those that occur along geological 
faults. The initiative’s third component was a 
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3.2-kilometre-deep hole drilled into Califor- 
nia’s San Andreas fault, although the effort was 
marred when instruments lowered down the 
hole stopped working after just days (see Nature 
459, 20-21; 2009). 

But the first two parts of EarthScope are 
yielding plenty of insights. “We've learned a 
lot more by integrating things together than 
we would have by doing them separately,’ says 
Robert Smith, a geophysicist at the University 
of Utah in Salt Lake City, and an early leader 
of EarthScope. 

When the Transportable Array was initiated 
in 2004, with funding from the US National 
Science Foundation, it was just a single station 
outside San Diego, California. Now, its 400 sta- 
tions are arranged at 70-kilometre intervals in 
a regular grid. Each stays in the ground for 
roughly two years before being dug up and 
redeployed to the array’s leading edge. The sta- 
tions capture seismic waves travelling through 
the ground from earthquakes near and far — 
information that geoscientists can use to probe 
deeper into Earth, and with greater regularity, 
than with other, smaller arrays. 

Among other things, the EarthScope seis- 
mic data have allowed researchers to construct 
images of the deep Earth, rather like a geologi- 
cal computerized-tomography scan (Y. Yang 
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Since 2004, 400 moveable seismometers have measured 


tremors in different parts of the United States. 


ALASKA 


J. Geophys. Res. 113, B12310; 2008), and to trace 
shaking from great earthquakes, such as Japan’s 
Tohoku quake in March 2011, to calculate howa 
fault ruptured half a world away (M. Ishii Earth 
Planets Space 63, 609-614; 2011). 

Some seismologists worried that things 
might get boring as the array trundled across 
the United States, leaving the seismically excit- 
ing west for the flatter east. But that did not hap- 
pen; in fact, the latest data show that the upper 
part of Earth’s mantle east of the Rocky Moun- 
tains is just as complex as that in the west, says 
Suzan van der Lee, a geophysicist at Northwest- 
ern University in Evanston, Illinois, who pre- 
sented the work on 30 October at the annual 
meeting of the Geological Society of America in 
Denver, Colorado. The array also captured the 
surprising magnitude-5.8 earthquake in Vir- 


giniain August 2011, . 

and the seismic shak- There willbe 

ing caused by Hurri- sSelsmometers 

cane Sandyasit swept inthe parts of 

the eastern seaboard the state that 

in October 2012. have never had 
Array deployments them.” 


went so well that the 

project had money remaining and was able 
to add 60 previously unplanned Canadian 
stations in southern Ontario and Quebec, to 
study a zone of seismicity along the St Law- 
rence seaway. Other dreams didn't transpire; 
some scientists wanted to ramp up from tak- 
ing 40 observations per second to 100. Such 
a change would have overloaded the data- 
storage and delivery systems, says the array’s 
manager, Robert Busby of IRIS. 

Now, as the array prepares to leave the 
east coast, project managers are tackling the 
logistics of the five-year, $40-million move 
to Alaska as soon the snow melts there next 
spring. About 160 of the 400 stations currently 
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in the east will remain behind, as part ofa plan 
drawn up by four federal agencies to fill in 
seismic monitoring gaps in heavily populated 
areas and near nuclear power plants. 

EarthScope managers will use the money 
saved to buy more durable equipment for 
Alaska, where the conditions are notoriously 
tough on outdoor gear. Busby’s team is acquir- 
ing high-capacity lithium-ion batteries that 
can be charged by solar panels in the sum- 
mer and survive all winter. Engineers are also 
developing methods to relay data by satellite 
phone, because the mobile-phone network in 
Alaska covers only a fraction of the state. 

And instead of burying seismometers in plas- 
tic containers, which could breakas the ground 
thaws and freezes, the team plans to install them 
deep in permanently frozen ground or solid 
rock. They have developed a lightweight drill rig 
that can be carried easily by helicopter, because 
many of the seismic sites have no road access. 

Logistical problems are one reason that the 
stations will be spaced less densely in Alaska 
than in the 48 contiguous states — every 
85 kilometres instead of every 70. Even so, 
they will provide a sharper and deeper view of 
what is happening beneath Alaska than ever 
before. “It'll make such a difference in so many 
ways, says Peter Haeussler, a geologist with the 
US Geological Survey (USGS) in Anchorage, 
Alaska. “There will be seismometers in the 
parts of the state that have never had them” 

Alaska also has some of the most complex 
geology in North America, with the Pacific crus- 
tal plate slamming into and diving under the 
continent. Yet little has been done to probe the 
underlying crust and mantle, mainly because 
instrumenting the entire state is so expensive. 
“We have sort of a ‘zeroth’ order of understand- 
ing,” says Rick Saltus, a USGS geophysicist in 
Denver. Now, he says, “we'll get the first order” m 
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IN FOCUS 


The fetus used by Carl Linnaeus as the type specimen of the Asian elephant. 


PROTEOMICS 


Proteins help solve 
taxonomy riddle 


Proteomic technique proves that 300-year-old Linnaean 
elephant was wrongly classified. 


BY EWEN CALLAWAY 


300-year-old pickled fetus that was 
Ae by Carl Linnaeus as the type 
specimen of the Asian elephant has 
been proved to be an African elephant — by 
the use of the emerging technology of ancient 


proteomics. 
Evolutionary geneticist Tom Gilbert, whose 


University of Copenhagen lab owns one of 
the world’s most advanced DNA sequencing 
machines, had failed to identify the species 
from a DNA sample seven years ago because it 
was too degraded. To sidestep this problem, he 
and his colleague Enrico Cappellini decided to 
run an advanced protein analysis using a sam- 
ple from the oesophagus of the fetus, held in 
the National Museum of Natural History in 
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Stockholm. “Clearly it’s got protein, because 
the damn thing still exists,” Gilbert says. 

The team sequenced several proteins in the 
specimen that were known to vary between 
elephant species, and each pointed to an Afri- 
can origin for the fetus (see go.nature.com/ 
ivzyzm). Linnaeus assigned the elephant fetus 
as the type specimen of the Asian elephant 
(Elephas maximus), but zoologists had long 
suspected that it was in fact an African ele- 
phant (now called Loxodonta africana). 

The work demonstrates the power of 
studying ancient proteins. Proteins are more 
resistant to degradation than DNA and could 
push the molecular fossil record back millions 
of years, as well as revealing biological insights 
that cannot be gleaned from genetic material 
alone, say scientists who study such remains. 

Ancient protein analysis is not a new pur- 
suit, says Matthew Collins, a biogeochemist 
at the University of York, UK. As long ago as 
1954, amino acids were detected in trilobite 
and dinosaur fossils. 

But the field took off in 2000, when Peggy 
Ostrom, a geochemist at Michigan State 
University in East Lansing, reported protein 
sequences from ancient bison and walrus 
bones up to 50,000 years old’. Her lab was 
the first to analyse ancient proteins using the 
technique of mass spectrometry, which ion- 
izes peptide fragments and then measures their 
mass, enabling identification when compared 
with reference databases. Mass spectrometry 
revolutionized the field of proteomics, in which 
hundreds of proteins from a tissue are analysed 
at once, and made it possible to sequence 
fragmented ancient proteins. 

Early efforts by Ostrom, Collins and oth- 
ers focused on individual proteins that are 
abundant in bone remains, such as collagen. 
“We call it the barcode of death,” says Collins, 
who uses collagen sequencing to quickly and 
cheaply identify species found at archaeo- 
logical sites, such as the animals used to make 
parchment or the horns on Viking helmets. 
Collagen is also remarkably stable: it has been 
sequenced from a 3.5-million-year-old fossil 
of a giant camel from the Arctic’. 

But collagen differs very little between closely 
related animal species, making it useless as a 
marker for evolutionary change. “You cannot 
tell an ibex from a domestic goat; you cannot tell 
a human from a Neanderthal,” Collins says. So 
Cappellini began developing methods to 
identify large numbers of different proteins 
in long-dead organisms. 
In 2012, his team identi- 
fied 126 proteins froma 
43,000-year-old woolly 
mammoth femur’. Ear- 
lier this year they se- 
quenced 73 proteins from 
a 750,000-year-old horse 
fossil*. Both specimens 
also yielded DNA, but the 
protein data could reveal 


For the full story on 
Linnaeus’s elephant, 
see: 
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the additional information about which genes 
were expressed in bone tissue. 

Like DNA sequencing, however, mass spec- 
trometry of an ancient sample can yield an 
overwhelming amount of data, says Ostrom. 
“Tt can be daunting, you can get hundreds of 
sequences from just a single sample of a single 
protein, from one individual, she says. And, 
like ancient DNA, ancient proteins suffer from 
contamination and chemical modification, 
although less is known about those processes 
with proteins, Ostrom says. Another drawback 
is that proteomics relies on databases of protein 


sequences to identify the proteins found. Such 
databases are lacking for most modern animals, 
let alone their ancient ancestors. 

Researchers studying ancient proteins hope 
to do more than just sort out museum collec- 
tions, although Gilbert and Cappellini are work- 
ing on plenty of these. Ancient proteins can 
indicate which genes were active in particular 
tissues from old specimens. They can also reveal 
phylogenetic information from specimens in 
which the DNA is too degraded to analyse, such 
as ancient samples from warm climates, where 
DNA doesn't last long, says Collins. 
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For instance, the evolutionary relationship 
between humans and Homo floresiensis, a 
short-statured human fossil from the Indone- 
sian island of Flores, is unclear, and efforts to 
extract DNA have so far failed. Ancient pro- 
teins might finally place this ‘hobbit’ correctly 
on the human family tree. m 
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Pathogen-research laws queried 


Scientists fear EU biosafety rules could complicate publication of work on infectious diseases. 


BY DECLAN BUTLER 


eading virologists have written to the 


president of the European Commission 

to urge him to clarify how laws designed 
to curb the proliferation of biological weapons 
apply to the publication of research on dan- 
gerous pathogens. The move by the European 
Society for Virology (ESV) comes after a Dutch 
court in September upheld a government order 
that scientists who engineered forms of H5N1 
avian influenza to make them transmissible 
between mammals needed to seek an export 
permit before publishing such work. 

The ESV’s five-page letter to José Manuel Bar- 
roso, dated 16 October, warns that the court rul- 
ing sets an unwelcome precedent. H5N1 is just 
one of more than 100 dangerous human, animal 
and plant pathogens and toxins that fall under 
European Union (EU) export-control legisla- 
tion from 2009. This means, say the virologists, 
that any EU scientist who works on one of the 
listed pathogens could be forced to apply for an 
export permit before publishing their research. 

They write that to better inform courts and 
policy-makers on scientific issues related to 
biosecurity laws, the European Commission 
should consider creating an equivalent of 
the US National Science Advisory Board for 
Biosecurity — an independent committee in 
Bethesda, Maryland, that advises on issues of 
biosecurity and dual-use research (findings 
that could be adapted for harmful purposes). 

The ESV also backs the case of Ron Fouch- 
ier, a virologist at the Erasmus Medical Center 
(EMC) in Rotterdam, the Netherlands, whose 
team engineered the H5N1 strains. It was 
Fouchier’s attempt to publish this work that led 
to the export-licence order. He has been fight- 
ing it— and on 31 October, the EMC contested 
the ruling in the Amsterdam Court of Appeal. 


The origin of the ESV’s letter dates back to 
2011, when Fouchier tried to publish a paper in 
Science that describes how he engineered H5N1 
strains that could be transmitted through the air 
between ferrets. This prompted a global outcry 
over the biosecurity of such ‘gain-of-function’ 
research — experiments designed to increase 
the transmissibility or pathogenicity of organ- 
isms (see nature.com/mutantflu). The Dutch 
government subsequently invoked European 
export-control legisla- 


tion to oblige Fouch- “Tf the benef: its 
ier and the EMC to outweigh the 
obtain anexport per- risks, the export 
mit before publishing licence will be 
their work. Fouchier granted.” 

initially threatened to 


defy the government, but conceded and applied 
for a permit under protest in April last year. The 
government granted it a few days later. 

Fouchier has argued that the EMC should 
not have needed a permit because the work 
fell under an exemption of EU export-control 
laws for ‘basic scientific research. He also said 
that the methods described had already been 
published. But these arguments were rejected 
by the court, which added in its ruling that the 
need to prevent the proliferation of biological 
weapons outweighed any delays in publication 
caused by government reviews of such papers. 

Giorgio Palt, a virologist at the University of 
Padua in Italy and president of the ESV, finds it 
particularly unsettling that the court also ruled 
that it was not up to Fouchier himself to decide 
whether to apply for an export permit. Palu 
thinks this could mean that researchers work- 
ing on any of the listed dangerous pathogens 
in the Netherlands, and perhaps in the whole 
of the EU, might automatically have to request 
export permits if they intend to publish papers 
on them. 


But Thijs van Son, a spokesman for the Dutch 
foreign-affairs ministry, says that this is not the 
case. He adds that according to Dutch govern- 
ment policy, the decision on whether to apply 
for an export permit still lies with the researcher. 

“If the researcher is convinced that the 
export-control legislation is not applicable to 
his research, he can decide not to apply for an 
export licence,” says van Son. “However, if he is 
mistaken, he risks committing an offence and 
receiving a fine or imprisonment.” Authori- 
ties, he adds, are happy to advise researchers 
on whether they need a permit. 

Van Son also says that the ESV letter is cor- 
rect that the ruling could lead to an increase 
in the number of papers that Dutch authori- 
ties are asked to review. But he says that the 
export-control law is an essential component 
of international efforts to halt the proliferation 
of chemical and biological weapons. 

“The licence requirement enables the export- 
control authorities to assess the proliferation 
and security risks of, for example, gain-of-func- 
tion research against the public-health benefits 
and the freedom to share scientific informa- 
tion, he says. “If the benefits outweigh the risks, 
the export licence will be granted. This was the 
case with the application of Professor Fouchier.” 

The Dutch H5NI1 case is an unprecedented 
test of the 2009 EU export laws, and it is not 
clear how other member states would handle 
a similar case. Ben Berkhout, a virologist at 
the University of Amsterdam and a member 
of the ESV’s executive board, says that the rul- 
ing has caused uncertainty as to what it cov- 
ers. “Most work on these pathogens is totally 
harmless,” he adds. He fears that the ruling 
will lead to added bureaucracy, and feels that 
it would be much better for governments 
to focus on the experiments that carry the 
greatest risks. m 


7 NOVEMBER 2013 | VOL 503 | NATURE | 19 


© 2013 Macmillan Publishers Limited. All rights reserved 


JULIEN TROMEUR/SHUTTERSTOCK 


NEW ANGLES ON THE BRAIN 


Technologies that probe neural circuitry could greatly 
advance the study of human cognition. 


science, at least in terms of funding, 

if not yet understanding. This year, 
the European Union and United States 
announced separate long-term programmes 
to study the human brain that, together, could 
pour more than US$2 billion into neurosci- 
ence during the next decade (that’s more than 
2 cents for every neuron in the body’s most 
complex organ). Driving these investments is 
the sense that researchers are on the verge of 
making leaps in understanding the brain — 
advances that could eventually lead to better 
treatments for mental disorders. 

In this special issue, Nature taps into this 
excitement with reporting and opinions on 
efforts to apply current technologies and 
invent new ones to grasp how all the grey and 
white matter in the brain actually works. 

Take the design of ‘neuromorphic hardware 
— computer systems that function according 
to similar principles as neurons and neural cir- 
cuits in the brain. After years of development, 
applications in robotics, mobile electronics and 
neuroprostheses are finally in sight (see page 
22). On the biological side, researchers are 
applying a broad array of techniques to map- 
ping neural connections in the brain (see page 
147). On page 31, neurologist Kenneth Kosik 


W e are entering the golden age of brain 


argues that technologies that can monitor 
thousands or millions of networked neu- 
rons are exactly what are needed to better 
understand and treat Alzheimer’s disease. 
Meanwhile, researchers with neuroimaging 
expertise are increasingly in demand, thanks 
to hopes that the technology will reveal infor- 
mation about everything from the progression 
of brain diseases to behaviour (see page 153). 

Amid the excitement, there are also sig- 
nificant questions about the major brain 
projects (see page 5). In the case of the US 
initiative, neuroscientists are not sure what 
kind of research it will support, and many in 
the field have spent much of the year strug- 
gling to define the scope of the project (see 
page 26). However, everyone agrees that the 
new opportunities presented by technology 
warrant exploitation. This message comes 
through especially vividly on page 29, where 
an anonymous neuroscientist with Parkinson's 
disease recounts his poignant experience of 
studying the brain while watching his own 
begin to fail. = 
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Computer chips inspired by human neurons can do more with less power. 


wabena Boahen got his first computer 
in 1982, when he was a teenager living 
in Accra. “It was a really cool device,” he 
recalls. He just had to connect up a cas- 
sette player for storage and a television set fora 
monitor, and he could start writing programs. 
But Boahen wasn’t so impressed when 
he found out how the guts of his computer 
worked. “I learned how the central process- 
ing unit is constantly shuffling data back and 
forth. And I thought to myself, “Man! It really 
has to work like crazy!” He instinctively felt 
that computers needed a little more ‘Africa’ 
in their design, “something more distributed, 
more fluid and less rigid”. 
Today, as a bioengineer at Stanford Univer- 
sity in California, Boahen is among a small 
band of researchers trying to create this kind of 


BY M. MITCHELL WALDROP 


computing by reverse-engineering the brain. 
The brain is remarkably energy efficient and 
can carry out computations that challenge the 
world’s largest supercomputers, even though 
it relies on decidedly imperfect components: 
neurons that are a slow, variable, organic mess. 
Comprehending language, conducting abstract 
reasoning, controlling movement — the brain 
does all this and more in a package that is 
smaller than a shoebox, consumes less power 
than a household light bulb, and contains 
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nothing remotely like a central processor. 

To achieve similar feats in silicon, research- 
ers are building systems of non-digital chips 
that function as much as possible like networks 
of real neurons. Just a few years ago, Boahen 
completed a device called Neurogrid that emu- 
lates a million neurons — about as many as 
there are in a honeybee’s brain. And now, after 
a quarter-century of development, applications 
for ‘neuromorphic technology are finally in 
sight. The technique holds promise for any- 
thing that needs to be small and run on low 
power, from smartphones and robots to artifi- 
cial eyes and ears. That prospect has attracted 
many investigators to the field during the past 
five years, along with hundreds of millions of 
dollars in research funding from agencies in 
both the United States and Europe. 


RAMIN RAHIMIAN 


Kwabena Boahen 
holds a ‘neuromorphic’ 
circuit board from his 
Neurogrid device. 


Neuromorphic 
devices are also pro- 
viding neuroscien- 
tists with a powerful 
research tool, says 
Giacomo Indiveri at the Institute of Neuro- 
informatics (INI) in Zurich, Switzerland. By 
seeing which models of neural function do or 
do not work as expected in real physical sys- 
tems, he says, “you get insight into why the 
brain is built the way it is”. 

And, says Boahen, the neuromorphic 
approach should help to circumvent a looming 
limitation to Moore's law — the longstanding 
trend of computer-chip manufacturers manag- 
ing to double the number of transistors they 
can fit into a given space every two years or so. 
This relentless shrinkage will soon lead to the 
creation of silicon circuits so small and tightly 
packed that they no longer generate clean sig- 
nals: electrons will leak through the compo- 
nents, making them as messy as neurons. Some 
researchers are aiming to solve this problem 
with software fixes, for example by using sta- 
tistical error-correction techniques similar to 
those that help the Internet to run smoothly. 
But ultimately, argues Boahen, the most effec- 
tive solution is the same one the brain arrived 
at millions of years ago. 

“My goal is a new computing paradigm,” 
Boahen says, “something that will compute 
even when the components are too small to 
be reliable” 


SILICON CELLS 

The neuromorphic idea goes back to the 1980s 
and Carver Mead: a world-renowned pioneer 
in microchip design at the California Institute 
of Technology in Pasadena. He coined the term 
and was one of the first to emphasize the brain’s 
huge energy-efficiency advantage. “That's been 
the fascination for me,” he says, “how in the 
heck can the brain do what it does?” 

Mead’s strategy for answering that question 
was to mimic the brain’s low-power process- 
ing with ‘sub-threshold’ silicon: circuitry that 
operates at voltages too small to flip a stand- 
ard computer bit from a 0 to a 1. At those 
voltages, there is still a tiny, irregular trickle 
of electrons running through the transistors 
— a spontaneous ebb and flow of current that 
is remarkably similar in size and variability to 
that carried by ions flowing through a channel 
in aneuron. With the addition of microscopic 
capacitors, resistors and other components 
to control these currents, Mead reasoned, it 
should be possible to make tiny circuits that 
exhibit the same electrical behaviour as real 
neurons. They could be linked up in decen- 
tralized networks that function much like real 
neural circuits in the brain, with communica- 
tion lines running between components rather 
than through a central processor’. 

By the 1990s, Mead and his colleagues had 
shown it was possible to build a realistic sili- 
con neuron’ (see ‘Biological inspiration’). That 


device could accept outside electrical input 
through junctions that performed the role of 
synapses, the tiny structures through which 
nerve impulses jump from one neuron to the 
next. It allowed the incoming signals to build 
up voltage in the circuit’s interior, much as they 
do in real neurons. And if the accumulating 
voltage passed a certain threshold, the silicon 
neuron ‘fired, producing a series of voltage 
spikes that travelled along a wire playing the 
part of an axon, the neuron’s communication 
cable. Although the spikes were ‘digital’ in the 
sense that they were either on or off, the body of 
the silicon neuron operated — like real neurons 
— inanon-digital way, meaning that the volt- 
ages and currents werent restricted to a few dis- 
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to test models of brain function. In September 
2006, for example, Boahen launched the Neu- 
rogrid project: an effort to emulate a million 
neurons. That is only a tiny chunk of the 86 bil- 
lion neurons in the human brain, but enough 
to model several of the densely interconnected 
columns of neurons thought to form the com- 
putational units of the human cortex. Neuro- 
scientists can program Neurogrid to emulate 
almost any model of the cortex, says Boahen. 
They can then watch their model run at the 
same speed as the brain — hundreds to thou- 
sands of times faster than a conventional digital 
simulation. Graduate students and research- 
ers have used it to test theoretical models of 
neural function for processes such as working 


“WE ENVISION BUILDING FULLY AUTONOMOUS 
ROBOTS THAT INTERACT WITH THEIR 
ENVIRONMENTS IN A MEANINGFUL WAY.” 


crete values as they are in conventional chips. 

That behaviour mimics one key to the brain's 
low-power usage: just like their biological 
counterparts, the silicon neurons simply inte- 
grated inputs, using very little energy, until 
they fired. By contrast, a conventional com- 
puter needs a constant flow of energy to run 
an internal clock, whether or not the chips are 
computing anything. 

Mead’s group also demonstrated decentral- 
ized neural circuits — most notably ina silicon 
version of the eye’s retina. That device captured 
light using a 50-by-50 grid of detectors. When 
their activity was displayed on a computer 
screen, these silicon cells showed much the 
same response as their real counterparts to 
light, shadow and motion‘. Like the brain, 
this device saves energy by sending only the 
data that matters: most of the cells in the retina 
don't fire until the light level changes. This has 
the effect of highlighting the edges of moving 
objects, while minimizing the amount of data 
that has to be transmitted and processed. 


CODING CHALLENGE 
In those early days, researchers had their hands 
full mastering single-chip devices such as the sil- 
icon retina, says Boahen, who joined Mead’s lab 
in 1990. But by the end of the 1990s, he says, “we 
wanted to build a brain, and for that we needed 
large-scale communication”. That was a huge 
challenge: the standard coding algorithms for 
chip-to-chip communication had been devised 
for precisely coordinated digital signals, and 
wouldn't work for the more-random spikes 
created by neuromorphic systems. Only in the 
2000s did Boahen and others devise circuitry 
and algorithms that would work in this messier 
system, opening the way fora flurry of develop- 
ment in large-scale neuromorphic systems. 
Among the first applications were large-scale 
emulators to give neuroscientists an easy way 


memory, decision-making and visual attention. 

“Tn terms of real efficiency, in terms of fidel- 
ity to the brain’s neuronal networks, Kwabena’s 
Neurogrid is well in advance of other large- 
scale neuromorphic systems,” says Rodney 
Douglas, co-founder of the INI and co-devel- 
oper of the silicon neuron. 

But no system is perfect, as Boahen himself is 
quick to point out. One of Neurogrid’s biggest 
shortcomings is that its synapses — of which 
there is an average of 5,000 per neuron — are 
simplified connections that cannot be modi- 
fied individually. This means that the system 
cannot be used to model learning, which 
occurs in the brain when synapses are modi- 
fied by experience. Given the limited space 
available on the chip, squeezing in the complex 
circuitry needed to make each synapse behave 
in a more realistic manner would require 
circuit elements about a thousand times 
smaller in area than they are at present — in 
the realm of nanotechnology. This is currently 
impossible, although a newly developed class 
of nanometre-scale memory devices called 
‘memristors’ could someday solve the problem. 

Another issue stems from inevitable varia- 
tions in the fabrication process, which mean 
that every neuromorphic chip performs 
slightly differently. “The variability is still 
much less than what is observed in the brain,” 
says Boahen — but it does mean that programs 
for Neurogrid have to allow for substantial 
variations in the silicon neurons firing rates. 

This issue has led some researchers to 
abandon Mead’s original idea of using sub- 
threshold chips. Instead, they are using more 
conventional digital systems that are still 
neuromorphic in the sense that they mimic 
the electrical behaviour of individual neurons, 
but are more predictable and much easier to 
program — at the cost of using more power. 

A leading example is the SpiN Naker Project, 
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BIOLOGICAL INSPIRATION 


Neuromorphic technology is based on neurons and neural circuits in the brain. 
Like the brain, it uses much less power than standard computer chips. 


Synapses 
Each neuron has about 10,000 

of these tiny junctions, which 
receive signals in the form of 
voltage spikes coming in from 4 
other neurons. 


Neuron cell body 


In both real neurons and 
neuromorphic versions, 
voltages and currents vary 
smoothly rather than jump 
in digital fashion from one 
discrete value to another. 


+ 


Connections 


Each emulated neuron 
receives signals through 
several thousand of these 
links, which are often much 
simpler than biological 
synapses. 


Emulated neuron 


Both real and emulated 
neurons add up, or integrate, 
incoming signals until they 
pass a threshold and ‘fire’, 
producing an outgoing series 
of voltage spikes. 


This fibre, which can be up to 
1 metre long, transmits the 
voltage spikes to other neurons. 


This mimics the axon, and 
carries the voltage spikes to 
other emulated neurons. 
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led since 2005 by computer engineer Steve 
Furber at the University of Manchester, UK. 
This system uses a version of the very-low- 
power digital chips — which Furber helped 
to develop — that are found in many smart- 
phones. SpiNNaker can currently emulate up 
to 5 million neurons. These neurons are sim- 
pler than those in Neurogrid and burn more 
power, says Furber, but the system's purpose is 
similar: “running large-scale brain models in 
biological real time”. 

Another effort sticks with neuron-like chips, 
but boosts their speed. Neurogrid’s neurons 
operate at exactly the same rate as real ones. 
But the European BrainScaleS project, headed 
by former accelerator-physicist Karlheinz 
Meier at Heidelberg University in Germany, 
is developing a neuromorphic system that 
currently emulates 400,000 neurons running 
up to 10,000 times faster than real time. This 
means it consumes about 10,000 times more 
energy than equivalent processes in the brain. 
But the speed is a boon for some neuroscience 
researchers. “We can simulate a day of neural 
activity in 10 seconds,’ Meier says. 

Furber and Meier now have the money to 
push for bigger and better. Together they con- 
stitute the neuromorphic arm of the European 
Union's ten-year, €1-billion (US$1.3-billion) 
Human Brain Project, which was officially 
launched last month. The roughly €100 mil- 
lion devoted to neuromorphic research will 
allow Furber’s group to scale up his system 
to 500 million digital neurons; Meier’s group, 
meanwhile, is aiming for 4 million. 


The success of these research-oriented pro- 
jects has helped to stoke interest in the idea of 
using neuromorphic hardware for practical, 
ultra-low-power applications in devices from 
phones to robots. Until recently, that hadn't 
been a priority in the computer industry. Chip 
designers could usually minimize energy 
consumption by simplifying circuit design, or 
splitting computations over multiple processor 
‘cores that can run in parallel or shut down 
when they are not needed. 

But these approaches can only achieve so 
much. Since 2008, the US Defense Advanced 
Research Projects Agency has spent more 
than $100 million on its SYNAPSE project to 
develop compact, low-power neuromorphic 
technology. One of the project’s main contrac- 
tors, the cognitive computing group at IBM’s 
research centre in Almaden, California, has 
used its share of the money to develop digital, 
256-neuron chips that can be used as building 
blocks for larger-scale systems. 


BRAIN POWER 

Boahen is pursuing his own approach to 
practical applications — most notably in an 
as-yet-unnamed initiative he started in April. 
The project is based on Spaun: a design for a 
computer model of the brain that includes the 
parts responsible for vision, movement and 
decision-making. Spaun relies on a program- 
ming language for neural circuitry developed 
a decade ago by Chris Eliasmith, a theoretical 
neuroscientist at the University of Waterloo 
in Ontario, Canada. A user just has to specify 
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a desired neural function — the generation 
of instructions to move an arm, for example 
— and Eliasmith’s system will automatically 
design a network of spiking neurons to carry 
out that function. 

To see if it would work, Eliasmith and his 
colleagues simulated Spaun ona conventional 
computer. They showed that, with 2.5 million 
simulated neurons plus a simulated retina and 
hand, it could copy handwritten digits, recall 
the items in a list, work out the next number 
ina given sequence and carry out several other 
cognitive tasks*. That’s an unprecedented range 
of abilities by neural simulation standards, says 
Boahen. But the Spaun simulation ran about 
9,000 times slower than real time, taking 
2.5 hours to simulate 1 second of behaviour. 

Boahen contacted Eliasmith with the obvi- 
ous proposition: build a physical version of 
Spaun using real-time neuromorphic hard- 
ware. “I got very excited,’ says Eliasmith, for 
whom the match seemed perfect. “You've got 
the peanut butter, we've got the chocolate!” 

With funding from the US Office of Naval 
Research, Boahen and Eliasmith have put 
together a team that plans to build a small- 
scale prototype in three years and a full-scale 
system in five. For sensory input they will use 
neuromorphic retinas and cochleas developed 
at the INI, says Boahen. For output, they have a 
robotic arm. But the cognitive hardware will be 
built from scratch. “This is not a new Neuro- 
grid, but a whole new architecture,” he says. It 
will trade a certain amount of realism for prac- 
ticality, relying on “very simple, very efficient 
neurons so that we can scale to the millions” 

The system is explicitly designed for real- 
world applications. On a five-year timescale, 
says Boahen, “we envision building fully 
autonomous robots that interact with their 
environments in a meaningful way, and oper- 
ate in real-time while [their brains] consume as 
much electricity as a cell phone”. Such devices 
would be much more flexible and adaptive 
than today’s autonomous robots, and would 
consume considerably less power. 

In the longer term, Boahen adds, the project 
could pave the way for compact, low-power 
processors in any computer system, not just 
robotics. If researchers really have managed to 
capture the essential ingredients that make the 
brain so efficient, compact and robust, then it 
could be the salvation of an industry about to 
run into a wall as chips get ever smaller. 

“But we wont know for sure,’ Boahen says, 
“until we try.’ = 


M. Mitchell Waldrop is a features editor for 
Nature based in Washington DC. 
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DRAIN 
STORM 


Barack Obama 
announced his 
BRAIN Initiative on 
2 April. Ever since, 
neuroscientists 
have been 
scrambling to 

work out what it 
actually is. 


BY HELEN SHEN 


mixture of excitement, hope and anxiety 

made for an electric atmosphere in the 

crowded hotel ballroom. On a Monday 

morning in early May, neuroscientists, 

physicists and engineers packed the room 

in Arlington, Virginia, to its 150-person 
capacity, while hundreds more followed by 
webcast. 

Only a month earlier, US President Barack 
Obama had unveiled the neuroscience equiva- 
lent of a Moon shot: a far-reaching programme 
that could rival Europe’s 10-year, €1-billion 
(US$1.3-billion) Human Brain Project (see 
page 5). The US Brain Research Through 
Advancing Innovative Neurotechnologies 
(BRAIN) Initiative would develop a host of 
tools to study brain activity, the president 
promised, and lead to huge breakthroughs in 
understanding the mind. 

But Obama’s vague announcement on 
2 April had left out key details, such as what the 
initiative’s specific goals would be and how it 
would be implemented. So at their first oppor- 
tunity — a workshop convened on 6 May by 
the National Science Foundation (NSF) and 
the Kavli Foundation of Oxnard, California 
— researchers from across the neuroscience 
spectrum swarmed to fill in the blanks and 
advocate for their favourite causes. 

The result was chaotic, acknowledges Van 
Wedeen, a neurobiologist at Harvard Medical 
School in Boston, Massachusetts, and one of 
the workshop’s organizers. Everyone was afraid 
of being left out of ‘the next big thing’ in neuro- 
science — even though no one knew exactly 
what that might be. “The belief is we're ready 
for a leap forward,’ says Wedeen. “Which leap, 
and in which direction, is still being debated” 

Others describe the BRAIN Initiative as a 
Rorschach test — an indeterminate entity that 
invited each researcher to project his or her 
own hopes and insecurities. But as the initia- 
tive has evolved, it has also come to resemble 
a large-scale sociological experiment, as the 
sprawling neuroscience community struggles 
to coalesce around a common research plan 
under intense public scrutiny and tough finan- 
cial constraints. 


A BIG PICTURE 

To the public, Obama's announcement seemed 
to come from nowhere; the president had 
never focused much on neuroscience before. 
In fact, the idea behind it had been spawned 
some 18 months earlier and almost 6,000 kilo- 
metres from the White House. At a meeting in 
Chicheley, UK, a group of neuroscientists and 
nanoscientists invited by the Kavli Founda- 
tion had developed their vision for the future 
of neuroscience research: to record electrical 
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impulses from thousands, or even millions, of 
neurons at once. 

That is the only way in which we might 
understand how thought emerges from the 
brain, argues Rafael Yuste, a neuroscientist at 
Columbia University 
in New York City who 
spearheaded the idea. 
Current technology 
can make recordings 
from only single neu- 
rons or small groups 
of neurons at a time 
— which, he says, “is 
like trying to watch a 
movie on TV by look- 
ing at one pixel”. 

To do better, the 
architects of the Kavli 
plan called for a Brain 
Activity Map (BAM) 
project: a technology- 
development pro- 
gramme that would 
give researchers the 
tools to start small, 
produce detailed maps of neural activity in 
simple organisms such as the fruitfly, and then 
move on to larger, more complex mammalian 
systems such as the mouse retina. They pre- 
dicted that, within 15 years, BAM would be 
able to simultaneously record all of the activity 
in a mouse cortex — and that primates, and 
even humans, would be next. 

BAM was intended to be provocative. “This 
is not going to happen if we keep waiting on 
little labs to do little things,’ declares Yuste. But, 
for many outsiders, it was ill-conceived — “a 
complete work of science fiction’, says Markus 
Meister, a neurobiologist at the California Insti- 
tute of Technology in Pasadena. Critics argued 
that the effort would take too long, cost too 
much and, ultimately, run up against the laws of 
physics, which limit how densely electrodes can 
be packed inside the brain. Moreover, creating a 
full activity map covering an organism's entire 
lifetime could yield a cripplingly large data set, 
while distracting from what many saw as the 
real problem: a dearth of computational and 
theoretical methods with which to interpret 
the brain’s activity. “We just don’t understand 
the data we have,’ says Mehrdad Jazayeri, a 
neuroscientist at the Massachusetts Institute 
of Technology in Cambridge. 

But BAM caught the attention of administra- 
tors at the White House, who were on the look- 
out for a bold presidential initiative (see Nature 
495, 19; 2013). The first hint of the administra- 
tion’s interest appeared in the president's State of 
the Union address on 12 February. Few neuro- 
scientists appreciated the significance until five 
days later, when they were jolted awake by an 
article on the front page of The New York Times, 
which reported that the White House planned 
to unveil a ten-year neuroscience initiative 


Some 

describe the 
initiative as a 
Rorschach test, 
inviting each 
researcher 

to project his 
or her own 
hopes and 
insecurites. 
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based on the Kavli idea. The article suggested 
that the initiative might receive federal fund- 
ing on par with the $3.8 billion spent on the 
Human Genome Project, and would produce a 
comprehensive, detailed map of neural activity 
in the human brain within a decade. 

The news alarmed many neuroscientists, 
who worried that few of them had been con- 
sulted, that the money would be made available 
at the expense of existing programmes and that 
failure to meet a seemingly impossible goal 
would undermine public trust in science. “This 
was a very narrow agenda ofa small group of 
people,’ recalls Partha Mitra, a neuroscientist 
at Cold Spring Harbor Laboratory in New York 
and a vocal critic of BAM. 

But the administration was already tak- 
ing a different tack. By the time of the official 
announcement on 2 April, the project had 
been rebranded the BRAIN Initiative and car- 
ried a comparatively modest price tag: only 
$110 million in federal funding for the 2014 
fiscal year. It no longer had a specific lifetime 


— although the White House implied that the 
project could last ten years or longer. 

And, unlike BAM, it had no clearly defined 
goal. Rather than promising to record from 
any particular number of neurons at once, 
Obama said simply that new tools were needed 
to help neuroscientists to develop better pic- 
tures of brain circuits in action — and that such 
technologies could pave the way to treatments 
for neurological disorders such as epilepsy, 
autism, Alzheimer’s disease and schizophrenia 
(see Nature 499, 272-274; 2013). 

Many neuroscientists found the announce- 
ment reassuring — at least the BRAIN Ini- 
tiative wasn’t BAM — but puzzlingly vague. 
All they knew was that the details would be 
left up to three government agencies: the 
Defense Advanced Research Projects Agency 
(DARPA), which would contribute $50 mil- 
lion in the first year; the National Institutes of 
Health (NIH), which would pitch in $40 mil- 
lion; and the NSE which would add $20 mil- 
lion. The initiative would be further supported 


FEATURE | NEWS 


by four private institutions, which had com- 
mitted to a total of $122 million over varying 
lengths of time (see ‘Obama's BRAIN’). 

Thus the crush at the NSF’s May workshop: 
after weeks of uncertainty, researchers were 
hungry for a chance to weigh in. Participants 
were asked to submit one-page proposals 
describing a major obstacle to understand- 
ing the brain. Then, in a frenetic pitch-fest, 
authors took the floor for one minute each to 
argue their cases. 

“What I care most about is reconstructing 
circuits accurately and fast,’ declared the first 
speaker, Albert Cardona. A neuroscientist at 
the Janelia Farm Research Campus near Ash- 
burn, Virginia, he pushed for improved auto- 
mated techniques to map the brain’s anatomy 
ona super-fine scale. Others called for equally 
fine-grained recordings from ever-larger 
numbers of neurons, in the spirit of BAM. 
Still others championed their favourite model 
organisms. And some speakers emphasized 
the importance of big-data storage, as well as 
the computational and theoretical advances 
required to make sense of all that information. 


BLURRED VISION 

To the growing exasperation of audience 
members, however, there was no convergence 
towards a coherent agenda for the initiative. 
No one could even say whether the initiative 
would be funded with new cash outlays or with 
money diverted from existing research. By the 
meeting’s end, the hotel lobby had become 
crowded with restless attendees who had aban- 
doned the talks to check e-mails, make phone 
calls and run their labs from afar. 

Among those who did stay were members 
of the NIH’s BRAIN Initiative advisory com- 
mittee, a 15-member panel dubbed the ‘dream 
team’ — a nickname it has since tried, unsuc- 
cessfully, to shake off. Co-chaired by neuro- 
scientists Cornelia Bargmann at the Rockefeller 
University in New York City and William 
Newsome of Stanford University in California, 
the panel’s first task was to prepare an interim 
report outlining the NIH’s science goals for the 
project's first year. Then, once that report had 
been delivered to the NIH in September, the 
team would start to develop a long-term imple- 
mentation plan, due in June 2014. 

Shortly after the NSF meeting, the NIH team 
started on its first order of business: convening 
a series of four workshops to gather input from 
the neuroscience community. These covered 
molecular techniques; large-scale recording 
technologies; computational and theoreti- 
cal neuroscience; and human brain studies. 
The difference in tone was striking. The NSF 
event had been like a cacophonous town hall 
meeting, whereas the NIH workshops felt 
more like an honorary lecture series. Each one 
began with public presentations by a dozen or 
so invited speakers, and the proceedings were 
carefully controlled. Once the open session 
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OBAMA'S BRAIN 


The White House has set lofty 
objectives for its BRAIN Initiative. 
Now itis up to the participants 
(purple) to develop a strategy 
for the programme. 


PRIVATE RESEARCH 


e The Allen Institute for Brain Science 
$60 million annually 


e Howard Hughes Medical Institute 
$30 million annually 


e Kavli Foundation 
$4 million annually for 10 years 


© Salk Institute for Biological Studies 
$28 million 


FEDERAL AGENCIES 


(First year funding) 


e Defense Advanced Research Projects Agency 
$50 million 


e National Institutes of Health 
$40 million 


e National Science Foundation 
$20 million 


was over, all of the speakers disappeared into 
closed discussions with the dream team. One 
participant likened that experience to being a 
delegate to the United Nations, with everyone 
seated around a long oval table behind printed 
name cards. 

Bargmann says that privacy was necessary 
to allow scientists to speak freely — and some- 
times critically — about different experimen- 
tal approaches. But, in the wider neuroscience 
community, many felt the selection of invited 
speakers and topics excluded their interests. The 
dream team weathered criticism from molecu- 
lar, cellular and developmental neuroscientists 
who felt underrepresented, as well as from clini- 
cal neuroscientists concerned that there was not 
enough emphasis on disease research. 

Adding to researchers’ anxiety was the fact 
that no one knew whether they would have 
any involvement in the arms of the project 
being run by the two other federal agencies. 
The deputy director of DARPA’ defence sci- 
ence office, Geoffrey Ling, said in June that his 
agency would not be releasing any road maps 
for its BRAIN Initiative efforts; meanwhile, 
the NSF’s lead on the project, biological sci- 
ences chief John Wingfield, said in September 
that the agency intended to wait for the NIH 
report before issuing its own plan, to avoid 
duplication. “There are limits to what we can 
do, he said, contrasting his agency’s roughly 
$150-million annual expenditure on neuro- 
science with the NIH’s $5.5-billion budget. 

Meanwhile, almost everyone was worried 
about where the funding would come from 
— especially as it became clear that Congress 
would not set aside any new money for the 
BRAIN Initiative’s first year. A modicum of 


e Provide the 
knowledge for 
addressing debilitating 
disorders. 


e Develop new 


imaging technologies 
and understand how 
information is stored 
and processed in 
neural networks. 


OBJECTIVES 


e Understand how 
brain activity leads to 
perception, decision- 
making and, ultimately, 
action. 


e Producea 
sophisticated 
understanding of the 
brain, from individual 
genes to neuronal 
circuits to behaviour. 


RESEARCH eOMMUNITY 


e The National Science Foundation convened 
workshops to solicit ideas. 


e The National Institutes of Health set up a 
‘dream team’ of 15 scientists that is producing 
reports based on community feedback. 


new funding should be forthcoming from the 
NIH: of the $40 million it agreed to commit, 
$10 million will come from the director’s dis- 
cretionary funds. Officials at the NIH and NSF 
maintained that the initiative would not derail 
existing programmes. But the dearth of dedi- 
cated new funds meant that the three federal 
agencies would have to begin, at least in part, 
by packaging together some ongoing projects. 
The initiative’s private partners, likewise, will 
mostly stick with existing programmes. The 
Salk Institute for Biological Studies in La Jolla, 
California, the Allen Institute for Brain Sci- 
ence in Seattle, Washington, and the Howard 
Hughes Medical Institute in Chevy Chase, 
Maryland, were all eager to frame the BRAIN 
Initiative as a continuation of research they 
already had under way. 


SOMETHING FOR EVERYONE 

By early September, with the private partners 
determined to do their own thing and two of 
the three federal agencies all but silent, inter- 
est in the NIH report had reached fever pitch: 
researchers saw it as the de facto national 
agenda. On 16 September, the advisory com- 
mittee at last published its interim report on 
science priorities. Many had feared that it 
would fail to be sufficiently inclusive, but the 
document was instead so staggeringly broad 
that it seemed to encompass all of circuit-based 
neuroscience. Cataloguing every cell type in 
the brain, mapping those cells’ full anatomical 
connections, monitoring and manipulating 
their signals, modelling and simulation — 
there was something for everyone. “It would be 
hard to disagree with this report,’ said Mitra. 
“It’s written, perhaps, with critics in mind.” 


28 | NATURE | VOL 503 | 7 NOVEMBER 2013 


© 2013 Macmillan Publishers Limited. All rights reserved 


NIH director Francis Collins tacitly con- 
ceded the report’s vastness when he formally 
accepted it. “These areas of research are expan- 
sive, and undoubtedly cover more research than 
NIH can fund with $40 million in one year,’ he 
said; the more ambitious elements could shape 
funding requests in years to come. And not just 
afew years, adds Yuste. “This is something you 
need 15 years and $3 billion to do” 

Tough choices lie ahead as the committee 
starts work on the long-term report it must 
deliver next June. The team will need to rank 
research priorities as short-, medium- and 
long-term goals, set timelines, estimate costs 
and define specific deliverable outcomes for 
the next few years — altogether a daunting 
task, says Newsome. 

“It’s much easier to see a year into the future 
than ten years into the future,’ agrees Barg- 
mann. 

Already, members of the working group are 
butting heads over questions such as whether 
ultra-detailed anatomical maps of the brain — 
painstakingly obtained with electron micros- 
copy (see page 147) — should take priority over 
lower-resolution maps that can be completed 
much more quickly using light microscopy. 

And then there is the question of manage- 
ment. Although the three government agencies 
have kept each other informed of their plans, 
the White House has so far indicated no inten- 
tion to coordinate the process more formally. 
This worries researchers such as Yuste, who 
is urging the creation of ‘brain observatories’: 
multi-agency facilities that, like particle accel- 
erators or giant telescopes, could provide com- 
munity access to technology too large, costly 
or specialized for individual labs to maintain. 
Without higher-level planning, he warns, such 
efforts will be impossible, and the BRAIN Ini- 
tiative’s investment could end up being squan- 
dered on many small grants awarded by the 
individual agencies. “The whole effort will not 
be more than the sum of its parts,” he warns. 

But others, including Bargmann, argue 
against throwing limited resources behind a 
monolithic, centralized project. “This is not 
the time to pick one approach and say this 
is the right approach,” she says. Instead, she 
hopes to foster the strongest and most creative 
ideas from individuals and groups of research- 
ers — and see where they lead. 

The NIH advisory committee hopes to draw 
on the creativity of the wider neuroscience 
community at the upcoming annual meeting 
of the Society for Neuroscience. On 11 Novem- 
ber, hundreds of neuroscientists are expected 
to pile into room 33C at the San Diego Con- 
vention Center in California to weigh in on the 
NIH’s interim recommendations. Armed with 
only a slightly more defined vision than they 
had six months ago, they will continue to try 
to define what the BRAIN Initiative can and 
should mean for their future. m 


Helen Shen is a reporter for Nature. 
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My life with Parkinson’s 


A neuroscientist reflects on his experience of studying the circuits that 
control neural activity while his own brain began slowly failing him. 


oughly a year ago, I found myself at 
R= elegant dinner party filled with 
celebrities and the very wealthy. Iam 
a young professor at a major research univer- 
sity, and my wife and I were invited to mingle 
and chat with donors to the institution. To 
any outside observer, my career was ascend- 
ant. Having worked intensely and passion- 
ately at science for my entire adult life, I had 
secured my dream job directing an inde- 
pendent neuroscience research laboratory. 
I was talking to a businessman who had 
family members affected by a serious medi- 
cal condition. He turned to me and said: 
“You're a neuroscientist. What do you know 
about Parkinson's disease?” 


My gaze darted to catch the eyes of my 
wife, but she was involved in another con- 
versation. I was on my own, and I paused 
to gather my thoughts before responding. 
Because I had a secret. 

It was a secret that I hadn't yet told any of 
my colleagues: I have Parkinson's. 

Iam still at the beginning of my fascinat- 
ing, frightening and ultimately life-affirming 
journey as a brain scientist with a disabling 
disease of the brain. Already it has given me 
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a new perspective on my work, it has made 
me appreciate life and it has allowed me to 
see myself as someone who can make a dif- 
ference in ways that I never expected. But it 
took a bit of time to get here. 


THE FIRST SIGNS 

I remember the first time I noticed that 
something was wrong. Four years ago, I was 
filling out a mountain of order forms for new 
lab equipment. After a few pages, my hand 
became a quaking lump of flesh and bone, 
locked uselessly in a tense rigor. A few days 
later, I noticed my walk was changing: rather 
than swinging my arm at my side, I held it 
in front of me rigidly, even grabbing the 
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> bottom edge of my shirt. I also had an 
occasional twitch in the last two fingers of 
my hand. 

I was 36 years old and it was the most ter- 
rifying time of my life, even without any of 
these mysterious symptoms. In the span of 
six months, I found myself in a job that I had 
spent 20 years preparing for, I becamea father 
for the second time, I moved across the coun- 
try to a town where we knew no one, and I 
was working alone in an empty lab wondering 
who left me in charge. I study the way that 
neuromodulatory chemicals such as dopa- 
mine affect neural activity and behaviour. 
And now, my own brain chemistry was rebel- 
ling against me. 

I considered many possibilities. A brain 
tumour? Dystonia? Motor neurone disease? 
Huntington's disease? Multiple sclerosis? 
Was I just stressed out? 

My diagnosis came from a young neu- 
rology fellow at one of the world’s leading 
centres for the study of movement disorders. 
He felt more like a peer than an authority 
figure. He, too, spent a lot of time in the lab 
doing basic research and published papers in 
some of the same journals as me; we could 
have just as easily run into each other at a 
scientific meeting. Asa result, the experience 
of my diagnosis was oddly collegial. 

Right away, I wondered how long I could 
get away without telling my colleagues. I wor- 
ried that I would be less likely to get the grants 
I needed to run my lab if the reviewers were 
not confident about investing in my future. 
I wondered whether students and postdocs 
would be afraid to join my research group. 
And, perhaps most importantly, how long I 
would be able to do experiments — the thing 
that I most love. Stiffness, shaking, fatigue, 
jerky movements, falls, drooling, laboured 
speech and the expressionless Parkinsonian 
mask. These could all be a part of my future. 


MIND MATTERS 

I was diagnosed with Parkinson's more than 
two years ago. From that day, I have hada 
different relationship with the brain — my 
scientific focus for the past 20 years. I now 
know what it is like to have a brain disorder 
and can explore its manifestations first hand. 
Take the very peculiar symptom known as 
‘freezing. Occasionally, when I attempt to lift 
my hand it well... won't. Notice that I didn't 
say can't. There is nothing wrong with my 
arm. It is still strong and capable of moving, 
but I have to put effort, even focus, into get- 
ting it to move — frequently to such a degree 
that I have to pause whatever else my brain is 
doing (including talking or thinking). Some- 
times, when no one else is around, I use my 
other hand to move it. 

As a neuroscientist, it is simultaneously 
fascinating and terrifying to be directly con- 
fronted with the intersection of the neuro- 
physiological and philosophical constructs 


of ‘will. The way my mind and body do battle 
forces me to reconsider the homunculus, a 
typically pejorative (among neuroscientists) 
caricature ofa little man pulling levers inside 
our heads, reading the input and dispatch- 
ing the output. Virtually all that we know 
about how the brain is organized belies this 
image, and yet there is a dualism to my daily 
experience. 
Parkinson’, particularly in young people, 
is primarily a disorder of motor control, not 
of cognition. Still, my 


“Tt does experience, however 
affect the limited, leads me to 
day-to-day speculate about what 
mechanics of it is like to be trapped 
experiments bya brain gone rogue. 
forme.” When one begins 


to lose the ability to 
interact with the world, and when one’s 
faculties for clear perception and cognition 
are stripped away, what remains of the con- 
scious self? 

This brings me to one of the main reasons 
that I have kept my disease secret: the stigma 
of ‘mental illness: Because most people do not 
understand Parkinson’, it may be confused 
with cognitive disorders such as schizophre- 
nia and Alzheimer’s disease. I feel as sharp 
and productive as ever, but I wasn’t sure 
that others would have faith in me at a time 
when my career is so fragile. So nearly every 
moment of my life became a performance, in 
which I tried to hide my symptoms. At work, 
at the grocery store, in my front yard, even in 
front of my kids — I am always keenly aware 
of my movements. And nowhere more so 
than at scientific conferences, such as at meet- 
ings of the Society for Neuroscience (SfN). 
You may not notice where my hands are, but 
I do. Often, I am sitting on them. 

Does Parkinson's affect the way I do 
science? It does affect the day-to-day 
mechanics of experiments for me. The tech- 
niques used in my lab require considerable 
motor skill at times. I have had to modify 
how I do some things, including taking more 
time, compensating with my good hand or 
using a different grip on instruments. Still, 
it is pretty remarkable how capable I remain 
at the bench. The lesson in this for me is that 
‘lab hands’ are more about experience, atten- 
tion to detail and adaptation of methods than 
they are about raw dexterity. And the low- 
to-moderate doses of drugs that I take really 
help, as do sleep and exercise. By all indica- 
tions, I will be able to continue research for 
many years, perhaps even indefinitely. 

There is also the question of how this 
diagnosis affects my scientific direction. I 
am sometimes asked whether I will wholly 
or partly switch to studying Parkinson’s dis- 
ease. I suppose I might if the right project 
came along, but in general I remain focused 
on the questions that I have already set 
out for myself. I am also sometimes asked 
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whether my diagnosis makes me impatient 
with the pace of discovery of cures. Here my 
answer is very clear. The dual perspectives of 
my condition and my position as an active 
researcher actually reinforce my belief in the 
importance of discovery science. Iam keenly 
aware that those cures are possible only in 
the wake of decades of basic research. Above 
all, my diagnosis makes me want to do the 
best and most exciting science I can, because 
the privilege could disappear for any of us in 
the blink ofan eye. 


TO TELL OR NOT TO TELL 

Back at the dinner party, all eyes were on me 
waiting to hear my thoughts on Parkinson’. I 
wanted to tell my colleagues what I was going 
through. I wanted to look at our donor and 
say: “Funny you should ask that. Not only 
am I a neuroscientist, but I also have Par- 
kinson’s disease.’ I wanted to launch into an 
eloquent monologue that put a personal face 
on the science of neurodegenerative disease. 
I wanted to conclude by saying, “And that is 
why basic brain research is so important.” 

But I didn’t. 

Instead, I dispassionately described the 
pathology and characteristic symptoms of 
Parkinson's. It was an intellectually engag- 
ing exchange, but it wasn’t the conversation 
it could have been. This is one of the main 
reasons I decided to stop hiding. 

Earlier this year, I told my department 
chair. Over the next few days, I told the 
administration, my lab and many of my col- 
leagues. It took a lot out of me, but it ended up 
being one of the best decisions I ever made. 
Everyone at work was so supportive — I felt 
silly for having spent four years, since the 
onset of my symptoms, worrying about how 
they would react. In the subsequent months, 
it has become a non-issue for me in how I 
interact at work. Everyone treats me like any 
other colleague, and it is such a relief not to 
worry about who knows anymore. It is still 
uncommon for me to tell someone new, but I 
do not do anything to hide my condition, just 
enough to not call attention to it. For anyone 
reading this who is going through something 
similar, I am here to tell you that life is too 
short to run from who you are. Your col- 
leagues might surprise you, and you can still 
bea great scientist despite a disability. 

So why am I writing this piece anony- 
mously? Because I don't want to be known 
to the scientific community as ‘Parkinson's 
guy’ before Iam known as a scientist. That 
said, I'm not hiding any more, so if you care 
you can dig enough to find out who I am. 
I’m okay with that. m 


The author is a neuroscience professor 
at a major US university. He blogs at 
parklifensci.blogspot.com and tweets at 
@Parklifensci. 

e-mail: parklifensci@gmail.com 
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Study neuron 
networks to tackle 
Alzheimer’s 


Researchers have generally looked for signs of 
Alzheimer’s disease in the whole brain or at the single- 
gene level. The biggest clues will come from monitoring 

collections of neurons, says Kenneth S. Kosik. 


wenty years of research and more 
| than US$1-billion worth of clinical 
trials have failed to yield an effec- 
tive drug treatment for Alzheimer’s disease. 
Most neuroscientists, clinicians and drug 
developers now agree that people at risk of 
the condition will probably need to receive 
medication before the onset of any cogni- 
tive symptoms. Yet a major stumbling block 
for early intervention is the absence of tools 
that can reveal the first expression of the 
insidious disease. 

So far, researchers have tended to focus 
on macroscopic changes associated with 
the disease, such as the build up of insolu- 
ble plaques of protein in certain areas of the 
brain, or on individual genes or molecular 
pathways that seem to be involved in disease 
progression. 

I contend that detecting the first dis- 
ruptions to brain circuitry, and tracking 
the anatomical and physiological damage 
underlying the steady cognitive decline that 
is symptomatic of Alzheimer’s, will require 
tools that operate at the ‘mesoscopic scale: 
techniques that probe the activity of thou- 
sands or millions of networked neurons. 
Although such tools are yet to be realized, 
several existing technologies indicate that 
they are within reach. 


CHARTED TERRITORY 

All the current approaches that are used 
to diagnose Alzheimer’s are crude and 
unreliable. Take the classic biomarkers of the 
disease: a build up of plaques of the protein 
B-amyloid in a person’s cerebral cortex, for 
instance, or elevated levels of the tau protein 
and dampened levels of 6-amyloid in their 
cerebrospinal fluid. Although such mark- 
ers are predictive of the disease, the interval 
between their appearance and the onset of 
cognitive problems is hugely variable, rang- 
ing from months to decades. 

Frequently, people with a high density 
of plaques or an ominous cerebrospinal 
fluid reading show no signs of dementia in 
behavioural tests. Others show the classic 
symptoms of the disease, such as memory 
loss, confusion and inability to formulate a 
simple plan. 

More consistently linked to cognitive 
difficulties are neurofibrillary tangles — 
aggregates of tau protein commonly found 
in the hippocampus and amygdala (the 
parts of the brain involved in memory and 
emotion) of people with Alzheimer’s, and 
in the cerebral cortex. Yet even this correla- 
tion is imperfect. And unlike for amyloid 
plaques, which can be monitored using 
a scanning technique known as positron 
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emission tomography (PET), no imaging 
procedure is yet available to detect tangles 
in living people. 

So although all these markers can be har- 
bingers of disease, none of them captures 
the subtle disruptions to brain circuitry 
that mark the onset of cognitive decline 
or enable researchers to track disease pro- 
gression from week to week and month to 
month. Nor do they reveal anything about 
the even shorter-term fluctuations in cogni- 
tive capacity frequently observed in people 
with Alzheimer’s. Relatives and carers often 
report, for instance, that for brief periods, a 
person with Alzheimer’s might behave nor- 
mally and be able to engage in conversation, 
and then an hour later be unable to remem- 
ber what just transpired. 

Aside from biomarkers, clinicians use 
reams of neuropsychological tests to 
diagnose Alzheimer’s disease. These test 
a person’s memory, emotional responses, 
language skills, and ability to solve prob- 
lems or count. However, such tests fail to 
capture the subtleties of the cognitive defi- 
cits at every stage of the disease. For exam- 
ple, asking someone to recall a recently 
viewed list of items will not reveal how well 
they are able to pull together memory frag- 
ments to make appropriate decisions and 
predictions. Similarly, being able to join 
up sequentially numbered dots (a standard 
test of executive function) is a far cry from 
mentally mapping out the steps needed to 
achieve some purpose in daily life, such as 
food shopping. 


IMAGING ISSUES 
One of the various types of brain imaging 
commonly used to probe and diagnose Alz- 
heimer’s is functional magnetic resonance 
imaging (fMRI). This tracks changes to 
blood oxygenation in the brain while a per- 
son is resting or doing certain cognitive tasks. 
The aim is to reveal faulty wiring or differ- 
ences in regional brain activity from what is 
usually observed. However, standard {MRI 
is limited to scanning the 86 billion human 
brain neurons and several thousand times 
more synapses with fewer than 20,000 voxels 
(volumetric pixels). With this technique, each 
voxel corresponds to an arbitrary volume of 
tissue rather than to a specific brain network 
that mediates cognitive processes. PET, which 
assesses blood flow and metabolism, as well as 
the location and density of amyloid plaques, 
has an even lower spatial resolution. 

Furthermore, the current tools used to 
diagnose Alzheimer’s disease and track its 
progression do not account for innate or 
acquired individual differences in brain 
structure that can drastically alter people’s 
tolerance of brain pathology. 

Numerous studies have suggested, for 
instance, that education and higher intellec- 
tual achievement, such as greater proficiency 


in languages, writing and speaking, can 
protect people from clinical Alzheimer’s dis- 
ease (with which people show severe cogni- 
tive deficits in behavioural tests). For example, 
a long-running study of nuns, which began 
in 1986, found that those diagnosed with the 
disease through post-mortem examinations 
had, decades before their death, displayed 
inferior language skills in application letters 
sent to convents compared with those with- 
out the disease’. This may help to explain why 
people with similar amyloid burdens in their 
cerebral cortex can show such dramatic dif- 
ferences in cognitive performance’. It also 
highlights why researchers should avoid 
evaluating the efficacy of a drug on its ability 
to break down amyloid plaques alone. 


UNCHARTED TERRITORY 

Several lines of evidence suggest that net- 

works of neurons in the brain operate as local 

processing units, with few long-range con- 

nections between them — an organizational 

pattern known as small world’. This evidence 
includes partial wir- 


“Researchers — ing diagrams of the 
should avoid fly and mammalian 
evaluating brain obtained from 
the efficacy anatomical tracings, 
ofa drug on circuit physiology 
its ability to and by reconstruct- 
break down ing nanometre-thick 
amyloid slices of brain tissue. 
plaques Optical-imaging tech- 
alone.” niques used to track 


neural activity across 
the entire brain in organisms such as zebrafish 
larvae also support this idea’. 

I believe that tools to analyse how neuron 
networks operate in the human brain will 
be crucial to probing the changes to brain 
circuitry underlying cognitive impairment 
in Alzheimer’s disease. One possibility is 
using minimally invasive ‘nanosensors’ 
that can travel to the brain through blood 
vessels and communicate neural activity”. 
Diamond-based materials are currently 
being developed to track the movements of 
single electrons across nerve membranes. 

Even now, several techniques could 
offer preliminary insights about the col- 
lective workings of different regions of the 
human brain. For example, particularly 
powerful MRI systems can scan the entire 
living human brain at a resolution of about 
0.7 cubic millimetres (most are limited to 
1-mm’ resolution)®. 

Meanwhile, researchers have used 
another imaging technique known as 
magnetoencephalography (MEG) — which 
maps neural activity by recording the mag- 
netic fields produced by electrical currents 
in the brain — to predict the development 
of preclinical Alzheimer’s disease, known 
as mild cognitive impairment, in 5 out 
of 15 people’. People with this condition 
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experience difficulties that are noticeable to 
them and to others but that are not severe 
enough to interfere with daily life. MEG has 
a poorer spatial resolution than MRI, but 
records responses in milliseconds rather 
than in seconds or minutes, capturing more 
of the numerous intermediary cognitive steps 
involved in mediating a person’s response toa 
picture, word or task. 

Likewise, molecular analyses could shed 
light on brain activity at the mesoscopic scale. 
Sequencing the RNA molecules expressed in 
different regions of the brain, in tissues taken 
from people who have died, has revealed co- 
regulated networks of genes associated with 
late-onset Alzheimer’s’. Of enormous value 
would be detailed maps showing the genes 
expressed, where they are expressed and how 
they relate to neural networks — as have been 
obtained for the sea urchin”. 


LOOKING AHEAD 

Such mesoscopic-scale analyses are pre- 
cisely what the US BRAIN (Brain Research 
through Advancing Innovative Neurotech- 
nologies) Initiative is about. Announced by 
US President Barack Obama in April, the 
aim of this project is to map and monitor 
the neural connections in the entire brains 
of experimental animals, and ultimately in 
the human cerebral cortex. 

Several influential neuroscientists have 
questioned whether an understanding of 
consciousness, perception, imagination, 
emotion and abstraction will emerge from 
measuring brain activity at the mesoscopic 
scale. I am similarly sceptical of such lofty 
claims. Yet I applaud the BRAIN Initiative 
for its potential to advance Alzheimer’s dis- 
ease research. Monitoring neuron networks 
is exactly what neuroscientists need to better 
understand, and ultimately to treat, a disease 
that is now costing the United States alone 
upwards of $200 billion every year. m 


Kenneth S. Kosik is at the Neuroscience 
Research Institute, and professor in the 
Department of Cellular, Molecular and 
Developmental Biology at the University 
of California, Santa Barbara, California 
93106, USA. 

e-mail: kenneth.kosik@lifesci.ucsb.edu 
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HUMAN EVOLUTION 


Us and them 


Tim Radford contemplates three fascinating studies on what it means to be human. 


sang the poet Alexander Pope. Of 

course, he knew nothing about tool- 
using chimpanzees, language-manipulating 
gorillas, self-recognizing orang-utans or 
problem-solving New Caledonian crows. Nor 
had he heard of extinct hominins, hunter- 
gatherer energy budgets, Palaeolithic arte- 
facts, the significance of sweat glands or the 
evolutionary rewards of cooking, bipedalism 
and long-distance running. 

Still, when he wrote in his 1732 Essay on 
Man abouta being “placed on this isthmus 
of a middle state, a being darkly wise, and 
rudely great’, he demonstrated what Thomas 
Suddendorf, author of The Gap, calls the 
“two foundational capacities” (think of 


CC Ts proper study of mankind is mar’, 


The Gap: The Science of What Separates Us 
from Other Animals 
THOMAS SUDDENDORF 


Basic Books: 2013. 


The Story of the Human Body: Evolution, 
Health and Disease 

DANIEL E. LIEBERMAN 

Pantheon: 2013. 


The Accidental Species: Misunderstandings 
of Human Evolution 

HENRY GEE 

University of Chicago Press: 2013. 


them as legs) that help us to stride the divide 
between ape and human minds. Humans 
share the capacity to compose scenarios 
and nest them within each other, and the 
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urge to communicate. Other social animals 
co-operate, deliver alarm calls and trade 
information. But none seems to match the 
human drive to link minds, to travel men- 
tally in time, to tell stories, stir emotions, ask 
questions, compose poems or write books 
about what it means to be human. 

This certainly looks to be the reason why 
Homo sapiens now constitutes eight times 
the biomass of all other wild terrestrial ver- 
tebrates combined. And it also sheds light 
on why one species, 


initially formed by the OD NATURE.COM 
blind forces ofnatural See Nature's essay 
selection acting upon _ series on being 
random mutation, humanat: 

might be about to step _go.tiature.com/khik9 


ILLUSTRATIONS BY PHIL DISLEY 


off the evolutionary treadmill altogether, 
and take control of its own future. That 
three authors — Suddendorf, Henry Gee and 
Daniel Lieberman — can take the same 
theme, address the same research and cite 
the same authorities to deliver three very 
different, complementary and equally enjoy- 
able books is a measure of the fascination of 
the topic, complexity of the arguments and 
the fragility of the evidence so far. 

Each author has made a scholarly career in 
the business of asking how we got here, and 
each approaches the question from a different 
perspective. Bipedalism has traditionally been 
seen as the starting point of humankind’s long 
journey from prey to predator, and from A fri- 
ca’s Rift Valley to world domination. But for 
the psychologist Suddendorf it is something 
that can be disposed of in a couple of pages. 
As he tells us, bipedalism freed the hands to 
grip and throw, but it came with serious side 
effects, “including back problems and hem- 
orrhoids”. 

For Gee, gate-keeper of the palaeonto- 
logical papers in this journal, bipedalism is 
just one change among many — one pecu- 
liar posture adopted by a group of animals. 
He notes in The Accidental Species that the 
posture is seen nowhere else, “but one could 
say the same for knuckle walking in chimps 
and gorillas, brachiation in gibbons, and the 
four-handed swing of orangutans”. 

For the evolutionary biologist and barefoot 
runner Lieberman, however, bipedalism was 
a “monumental and consequential” shift. 
The two-legs-good, four-legs-bad effect is 
discussed on at least 40 pages of The Story of 
the Human Body; running, too, gets a good 
show. We are what we are because our bod- 
ies could do what they did. The legs of Homo 
erectus were 10-20% longer than those of the 
hominin Australopithecus, which meant the 
first humans could cover great distances at a 
lower energy cost. But longer legs make arbo- 
real life difficult, so once humans got mov- 
ing, they had to stay on the road. Lieberman 
argues that it is “not just incorrect but also 
dangerous to view modern human evolution 
as solely a triumph of brains over brawn” It 


was the hunter-gatherer physique that got us 
to where we are now. And today’s burgeoning 
human ills — obesity, cardiovascular disease, 
cancer, sleep apnoea, terrible teeth, osteopo- 
rosis and so on — occur because we eat more 
generously than hunter-gatherers but work 
considerably less. 

All three authors mention brain size, but 
Suddendorf points out that, when it comes 
to relative brain size, humans don’t top the 
charts. There are mice and shrews in which 
the brain makes up an extraordinary 10% 
of overall body weight (the human pro- 
portion is 2%). So 


if brain capacity is “Each author 
what makes the dif- asmade 
ference, thereshould 45 cholarly 

be some other scale careerinthe 
that puts humans at business of 

the top. Suddendorf asking how we 
and Andrew Whiten gofhere.” 


have proposed one: an 
excess of absolute brain mass over and above 
that predicted by body size. Meanwhile, Gee 


points out that crows demonstrate a kind of 


calculation and craftiness that humans rec- 


ognize. Because the common ancestor of 


birds and primates lived more than 250 mil- 
lion years ago, this “shoots a huge hole” in 
the idea that modern humans are very clever 
just because previous hominins were quite 
clever, and the ones before them only rela- 
tively so. 

We can't ask the earlier hominins. H. sapiens 
once shared the planet with H. erectus, 
Neanderthals, Denisovans and the little Hob- 
bit of Flores. They have all gone. The last man 
standing must now compose answers on the 
basis of only the slight, capricious and often 
ambiguous evidence that remains. Gee calls 
this “the Beowulf effect”: Old English verse 
survives mainly in just four manuscripts, 
and one of them, which includes Beowulf, 
almost perished in a blaze in 1731. What if it 
had burned, uncopied? What then would we 
know about the song and story of the past? 

Lieberman builds up a picture of vanished 
society by examining humankind as it is now. 
He looks at Tanzanian hunter-gatherers and 
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the Kalahari Bushmen, and works backwards 
to shape an increasingly speculative story of 
how things might have been as foraging, 
scavenging, hunting and resourceful humans 
colonized even the most inhospitable habi- 
tats. Suddendorf is more concerned with the 
things we can learn from other surviving pri- 
mates. Yes, apes cooperate, communicate, use 
tools, share knowledge, solve problems, dem- 
onstrate self-awareness and display emotions. 
But he carefully leaves open the big question: 
how much can you conclude from each case 
study? Gee, meanwhile, gazes into both past 
and future, and sees the idea of evolutionary 
upward mobility (or as Pope had it, “upward 
will he soar, and little less than angel, would 
be more”) a profound misreading of Darwin. 

Both Suddendorf and Lieberman directly 
address the question of natural selection in 
a world in which humans have seemingly 
taken control of nature, and ensured the 
survival of the not-so-fit. Puzzlingly, says 
Suddendorf, “the rich, successful, powerful, 
beautiful, and well-educated people seem to 
breed less, not more, than most of the rest of 
us”. But he suspects humans could find more 
dramatic ways of cutting short their own 
success story, with a little help from war and 
famine. Lieberman would have us get up off 
our chairs, set down our books and chew 
tough fibrous stuff: our bodies may not be 
the best of all possible bodies, but they are 
the only ones we have, and we should look 
after them. 

All three books would make marvellous 
gifts. The Gap is ideal for someone who 
already has a decent collection about human 
evolution. The Story of the Human Body is 
a readable introduction to the whole field 
and great on the making of our physicality. 
The Accidental Species is discursive, rich in 
good stories and terrible jokes, and a salutary 
reminder of how little we know. I shall hang 
on to all three. = 


Tim Radford is a former science editor of 
The Guardian and author of The Address 
Book: Our Place in the Scheme of Things. 
e-mail: radford.tim@gmail.com 
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URBAN STUDIES 


A paved paradise 


Mike Davis explores a vision of car-free, socially networked urban environments. 


appy City opens with a joyful 
Hein rush. Charles Montgom- 
ery is sweating at the pedals ina 

mountain-bike dash through the streets of 
Bogota with the city’s charismatic “Mayor of 
Happiness’, Enrique Pefalosa. “Then he was 
off? Montgomery marvels, “jumping curbs 
... and barking into his cell phone while his 
pin-striped trousers flapped in the breeze.” 

The image of hyperkinetic populist Pefa- 
losa — who was mayor from 1998 to 2001 
— rushing through barrios and urging his 
citizens to stop using cars and organize 
fiestas, may be romanticized. But he did 
divert the highway budget into building 
bike paths, pedestrian plazas and Bogotd’s 
first rapid-transit system. In Montgomery’s 
view, Pefialosa had “redesigned the experi- 
ence of city living for millions of people” 

The example of Bogota provides an 
irresistible introduction to Montgomery's 
thesis that a growing global rebellion against 
private cars, exclusionary zoning and sprawl 
couldbethedawnof 
anew urban world. |Montgomery 
From Milwaukee in focuses on 


Wisconsin to Seoul, the wider 
Montgomery argues, applicability 
the “battle for the ofurban- 
shape and soul of design 

cities ... is finally innovations 
reaching a critical fo generic 
mass’. After decades places.” 


of ferment, neigh- 

bourhood activism is rapidly evolving 
from protest against individual grievances 
to an alternative politics of urban design. 
Montgomery exuberantly lists the changes: 
for example, shopping malls reclaimed as 
‘mini-villages’; towns recast for children’s 
needs; and fenceless neighbourhoods. As 
he puts it, visionary urban designers and 
politicians “are reorganizing the systems 
that hold cities together and rewriting the 
rules that dictate the shapes and functions 
of our buildings”. 

None of this chimes with my own daily 
experience of southern California life as an 
‘ultra-commuter’ (my job is 160 kilome- 
tres from home). Nor does it give me hope 
that some miracle will reverse the United 
States’ national indifference, much of it 
ominously racial in origin, that tolerates 
the death of Detroit, Michigan. Once the 
centre of car manufacture and the Motown 
music industry, Detroit is now a bankrupt 
shell with barely 40% of its 1950 population. 


But Montgomery has no time for pessi- 
mists, writing that if Bogota or Athens can 
be “reconfigured to boost happiness’, these 
principles can heal richer cities around the 
world. 

The ‘happy city’ thesis is a tough one to 
lift off the ground, and early parts of the 
book simply do not fly. For example, Mont- 
gomery sets up a contrast between the good 
city and the bad suburb that fails to sample 
anything like the average reality of metro- 
politan life. To illustrate urban happiness, he 
has a huge home-court advantage: he lives in 
Vancouver, Canada, the most well-planned 
and prosperous big city in North America. 
At the other extreme, he takes a bizarre bus 
trip with home bargain-hunters through a 
foreclosed and semi-abandoned suburb near 
Stockton, California — ground zero of the 
2008 US mortgage apocalypse. 

Most of Montgomery’s critique of the 
emotional emptiness and environmental 
costs of suburban life is familiar socio- 
logical boilerplate that dates back to the 
1950s. He understandably hurries through 
this to arrive at a more interesting sub- 
ject: environmental psychology’s view of 
human responses to the built environment. 
Numerous studies, such as the global World 
Values Survey, show that people are happier 
in densely social, architecturally complex, 
pedestrian-oriented, park-rich and cultur- 
ally stimulating environments. When effi- 
cient alternatives are available, they will even 
cut the umbilical cords to their cars. 

But Montgomery also cites uncomfortable 
research from sources including Harvard 
University in Cambridge, Massachusetts, 
and the University of Zurich in Switzerland. 
This shows that too many North Americans 
and Germans are socially programmed to 
choose status symbols such as supersized 
suburban homes at the end of nowhere over 
a connection with other people or a walk in 
a park. 

None of this is Earth-shaking news. 
Bookshelves groan with glossy catalogues 
of urban invention and ‘smart growth — a 
genre that, too often, is simply a celebration 
of gentrification or an advertisement to move 
to Canada or Scandinavia. What redeems 
Montgomery's book 
from his own miscast 
exhilaration is the sur- 
prising richness and 
critical acuity of his 
case studies. Although 
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cherry-picked from 
such privileged labo- 
ratories as Vancouver 
and Copenhagen, he 
focuses on the wider 
applicability of urban- 
design innovations to 
generic places, even 
the suburban bad- 


Happy City: lands of California and 
Transforming Our "Texas. 
Lives Through are 

: A fascinating exam- 
Urban Design lee th f f 
CHARLES ple 18 the re1orm © 
MONTGOMERY zoning restrictions 
Farrar, Straus and in Vancouver’s older 
Giroux: 2013. ‘streetcar suburbs. 


Most planners define 
the essence of “Vancouverism as its down- 
town peninsula — a utopian mini-Manhat- 
tan with quiet high-rise neighbourhoods and 
almost no traffic congestion. Montgomery, 
however, rightly sees a more revolutionary 
step in the city’s legalization of the conver- 
sion of alley garages into cottages and base- 
ments into apartments. This simple zoning 
reform has enabled “one of the biggest urban 
infill projects on the continent’, allowing 
thousands of residents to find affordable 
housing in inner suburbs. Hopefully, plan- 
ners and housing advocates in Los Angeles 
and other North American cities with short- 
ages of affordable housing will recognize the 
ingenuity of this initiative and the part that 
decent mass transit plays in making it work. 
Montgomery offers other compelling exam- 
ples showing the power of community imag- 
ination, or simply its common sense, when 
unleashed. 

But it would be folly to ignore the spe- 
cial conditions that make the innovation of 
urban happiness possible in Montgomery’s 
case-study cities. Without exception, these 
locations have had strong social-democratic 
or progressive governance. It is hard to imag- 
ine that market forces in sunbelt cities such 
as Phoenix, Arizona, would ever allow such 
communitarian design principles to sink 
deep roots. And, for most of the world’s 
present and future urban population, from 
Lagos in Nigeria to Chengdu in China, the 
conversation first needs to take place at the 
more fundamental level of basic needs and 
human rights. m 


Mike Davis is a writer and urban historian 
based in San Diego, California. 
e-mail: michael.davis@ucr.edu 
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Jad Abumrad 
Airwave trailblazer 


Jad Abumrad co-hosts Radiolab, the science-heavy, nationally syndicated US public-radio show. 
As it enters its tenth year, he takes time out from a 21-city North American tour with co-host 
Robert Krulwich to talk about crafting high-speed science stories on radio. 


How did you get into science and radio? 

I grew up in Tennessee in the 1980s, an 
Arab kid in a Southern Baptist landscape. 
My mother is a molecular biologist and my 
father is a surgeon, and as a kid Id be stuck 
in their labs after school, bored out of my 
skull and playing with the rats. I gravitated 
to music, and as a teen I would stay in my 
room composing scores for imaginary films. 
After studying writing and music in college, 
I started volunteering in public radio and 
eventually secured a late-night slot to air 
forgotten documentaries on WNYC — New 
York Public Radio. Two days before the show 
aired, I asked my boss who the host would 
be. He said, “You.” 


How did the show evolve? 

In the early days, the station mercifully 
treated me with benign neglect. I didn’t know 
what I was doing. Most of the show was old 
material, borrowed and rehashed. Inevitably 
there would be a hole in the programme, and 
in a panic I would throw together a little seg- 
ment on the politics of Zimbabwe or I'd ask 
listeners to scream on my voicemail. In 2003, 
I met science broadcaster Robert Krulwich 


and we experimented with combining radio 
formats such as interviews, documentaries, 
storytelling and music. We settled on a ‘two- 
guys-talking’ format, but with quotes from 
scientists and surreal soundscapes popping 
out like thought bubbles. From the beginning, 
Radiolab had a digital sensibility, a sense of 
speed and density that some have heralded 
as the future of public radio and others have 
complained is too jittery. 


Do you ever run up against the limits of radio? 
In our episode on colour, when talking 
about the retina with a biologist, we asked 
which creature had the most types of colour- 
receptor cone in its eyes. The answer was the 
mantis shrimp, which has 16 cones, each 
receptive to a different wavelength of light. 
Unable to convey this visually, we gathered a 
160-voice choir in a cathedral, divided them 
up according to colours in the spectrum 
and asked them to sing us a rainbow as the 
shrimp would see it. 


What happens at your live show? 
Apocalyptical tells stories of mass destruc- 
tion, focusing on the extinction of dinosaurs 


BOOKS & ARTS 


at the end of the Cretaceous period 65 mil- 
lion years ago. Some moments, in which 
you learn bits of chemistry and physics, or 
the lights dim and we all listen deeply, feel 
like the on-air show. The rest is strange and 
new. There is animation and video to reveal 
the mechanics of violent collapse. There are 
giant dinosaur puppets made by the Aus- 
tralian theatre company Erth. Musicians 
create swells of sound that make you feel 
like the roof is going to fall in. Comedians 
like Reggie Watts and Patton Oswalt will 
make you laugh. We're all moving towards 
our own end, as individuals and asa species, 
but we hope to leave you with a sense of how 
extraordinary it is to be alive. 


What is the gist of the science in the show? 
We present a theory proposed by scien- 
tists Jay Melosh, Peter Schultz, Douglas 
Robertson and Kirk Johnson that draws on 
ballistic simulations to argue that the Creta- 
ceous extinction may have been much faster 
than previously thought. The conventional 
picture is ofa global ‘nuclear winter’ brought 
on by ash thrown up into the atmosphere by 
a meteor that killed off the dinosaurs over 
tens of thousands of years. We argue that 
the extinction may have taken just a single 
afternoon. 


Is it true that you plan to branch out from 
science? 

Yes and no. Scientific ideas can be irreduc- 
ibly complex and wrestling with them can 
be exhausting. But it’s no longer acceptable 
for people not to understand what’s hap- 
pening in science, and I do find science 
endlessly inspiring even when it’s pissing 
me off. So science reporting will always be 
a fundamental part of what we do. But in 
recent months I have thought about getting 
into legal affairs. I wonder sometimes what 
it would be like to cover an election. And 
what about sports? 


Have you got things wrong? 

Absolutely. For a show about stochasticity, 
we wanted to demonstrate how hard it is 
to tell a signal from noise at the molecu- 
lar level. So we ran a tape of a 99-year-old 
woman singing through a noise filter, over 
and over. We felt it was a gorgeous meta- 
phor for how our bodies deal with biologi- 
cal ‘noise; the topic of a reported segment by 
Carl Zimmer. But Carl said that it was not 
even remotely correct. We had an epiphany: 
let's just play the incorrect version and then 
broadcast Carl telling us how wrong it is, 
followed by a good-faith attempt to correct 
ourselves. We've never claimed to know what 
were talking about, which may explain why 
no one has come to stick a pitchfork in our 
foreheads. = 
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Brain tissue affected by Alzheimer’s disease prepared for microscope examination. 


ANTHROPOLOGY OF MEDICINE 


Where we are now 
with Alzheimer’s 


Eus Van Someren welcomes a call to critically evaluate 
progress in research on this form of dementia. 


questionnaire used worldwide to evaluate 

the severity of dementia, an early question 
is “Where are we now?” That is also the pri- 
mary question of Margaret Lock’s The Alzhei- 
mer Conundrum. This anthropological study 
peers back at a century of grappling with Alz- 
heimer’s disease (AD) and surveys the quest 
to understand its mechanisms and the scien- 
tific enterprise needed to unravel them. 

Lock dares to pose big questions, and is well 
prepared to do so. For ten years, she inten- 
sively studied epidemiological and neurobio- 
logical research on AD, went to meetings, and 
interviewed experts, people at risk and carers. 
The result reads like a travel diary, with a bal- 
ance between engagement and objectivity. 
Lock highlights how scientific progress can be 
both accelerated and impeded by paradigms, 
the entangled interests of the pharmaceutical 
industry, the media and personal beliefs. She 
covers the history of AD, prominent views on 
its cause, risk factors such as mild cognitive 
impairment, and biomarkers. 

Along the way, Lock dissects three fun- 
damental tensions. The first is between the 
‘localization approach that attributes demen- 
tia to specific neuropathological changes, and 
the ‘entanglement approach that includes 
complex interactions between mind, envi- 
ronment, ageing and life events. The second 
is between the view that AD is intrinsic to age- 
ing, and the understanding that it is a distinct 
pathology. The third is between deterministic 
genomics, and the view that adds epigenetics. 


le the Mini Mental State Examination, a 


The AD trail that 
Lock follows began in 
Germany more than a 

| €3 century ago. The dis- 
‘ covery of the disease 

by neuropathologist 

| Alois Alzheimer in 
=~ 1906, and its nam- 
 —7 ing by his colleague, 
The Alzheimer the psychiatrist Emil 
Conundrum: Kraepelin, rested on 
Entanglements a new technique for 
of Dementia and which neurologists 
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Aging : : 
BY MARGARET LOCK ns eas and 
Princeton University antl ABO Oy: 
Press: 2013. Cajal received a Nobel 


prize. Their staining 
techniques for brain tissue made the invis- 
ible visible and revealed the amyloid plaques 
and neurofibrillary tangles that have come 
to be regarded as key players in dementia. 
After decades of dormancy, AD research 
reappeared on stage in 1966. Psychiatrist 
Martin Roth and his colleagues reintro- 
duced Alzheimer’s idea, describing autop- 
sies of patients diagnosed with dementia 
that showed not arteriosclerosis — then the 
primary suspect — but numerous plaques 
and tangles. In 1992, neuroscientists John 
Hardy and Gerald Higgins proposed that 


NEW ANGLES ON THE BRAIN 


A Nature special issue 
www.nature.com/neuroscience2013 
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the aggregation of amyloid interferes with 
neuronal signalling and causes dementia. 
Since they formulated their amyloid-cascade 
hypothesis, this has been the dominant view 
on the cause of AD. 

The need to understand the mechanisms 
of the disease, and find targets for prevent- 
ing it, is becoming urgent. With the greying 
of society, it is predicted that some 66 mil- 
lion people worldwide will have dementia 
in 2030. Funded by governments and the 
pharmaceutical industry, studies are being 
undertaken that are unprecedented in 
terms of sample size, follow-up and com- 
prehensiveness. By tracking genetics, brain 
structure and cerebrospinal fluid, almost 
all the studies aim to localize the pathways 
and mechanisms proposed by the amyloid- 
cascade hypothesis. Most assume that this 
research roller coaster will reveal what 
causes AD. But will it? 

Lock invites readers to look at alterna- 
tives to our current path. She delivers key 
concepts in epidemiology, neuroscience and 
genetics in a way that is both scholarly and 
free of unnecessary technical details. Lock’s 
bird's-eye view and mix of diverging sources 
of information is refreshing. For example, she 
writes of the astonishing reluctance of indi- 
viduals screened for biomarkers to change 
their convictions about their risk of develop- 
ing AD. Just pages later, she recounts the very 
similar reluctance of scientists to change their 
convictions about the mechanism by which 
the condition develops — even when faced 
with findings that contradict the amyloid- 
cascade hypothesis. 

Lock calls for a paradigm shift. She hopes 
that careful investigation of lifestyle and 
environmental exposure, and their effects 
on gene expression, will reveal opportunities 
for intervention. But if we find them, will we 
be able to implement them? Our ability to 
change unhealthy behaviours lags far behind 
our recognition of them as unhealthy, and 
we have only begun to touch on ways to help 
people take pleasure in healthy lifestyles. 
Moreover, restrictive measures on the mar- 
keting of temptations may become a major 
issue for industry and economies. Lock 
invites us to face such challenges and take 
responsibility for implementing preventive 
measures worldwide, irrespective of national 
and personal incomes. For its wide scope and 
balanced critical evaluation, The Alzheimer 
Conundrum isan inspiring read for everyone 
working in the field. m 


Eus Van Someren is a professor of 
neurophysiology, founder of 
sleepregistry.org and head of the Sleep & 
Cognition department of the Netherlands 
Institute for Neuroscience of the Royal 
Netherlands Academy of Arts and Sciences, 
Amsterdam, the Netherlands. 

e-mail: e.van.someren@nin.knaw.nl 
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Correspondence 


Discard Soviet 
doctorate systems 


The research performance of 
most former Soviet states is still 
low compared with that of many 
Western countries. In my view, 
progress will be stifled as long 

as the archaic academic systems 
passed down from the former 
Soviet Union persist. 

Ihave analysed the research 
performance of several of these 
nations on the basis of their 
per capita gross expenditure 
on research and development 
(GERD; see go.nature. 
com/8oclnj) from 1998 to 2010 
and their number of research 
publications from 1998 to 2012 
(see www.scimagojr.com). 

The Baltic countries — Estonia, 
Lithuania and Latvia — were 
notably more productive in 2012 
than Russia, Ukraine and Belarus 
(see ‘Research performance in 
former Soviet nations; left), with 
Estonia's publication output per 
1,000 people approaching that of 
the United States (1.7 in 2012). 
Also, the research performance 
of the Baltic states has progressed 
markedly since 1998 (see figure, 
right). 

The figure also shows that 
each country’s publication 
output seems to have improved 
as research funding has 
increased. Georgia, however, had 
no increase in GERD between 
1998 and 2005 (no more recent 
data are available), yet managed 
to more than triple its number of 
research publications between 
1998 and 2012. Russia and 
Belarus have given a particularly 
poor return on investment in 
terms of research publications. 

The success of Georgia and the 
Baltic states since 1998 may be 
connected with their adoption 
of internationally recognized 
standards for doctorate degrees 
and academic promotion. 

These countries now use a 
Western PhD model dedicated 
to original research, and 

career advancement depends 
on producing peer-reviewed. 
publications — which is not the 
case in Russia, Ukraine, Belarus 


and other former Soviet states. 

Those poorly performing 
countries continue to use the 
old Soviet split-level doctorate 
degree. This degree can take more 
than 20 years to complete and 
involves only a minimal initial 
training in research, which may 
not be original and is not subject 
to independent peer review. The 
rest of a student's time is taken up 
with administrative tasks — for 
example, preparing reports for 
committees and government 
departments. 

The poor research 
performance in most former 
Soviet states stems partly from 
weak funding management (data 
not shown). But there is also an 
urgent need for fundamental 
academic reform in those 
countries. 

Alexander Gorobets Sevastopol, 
Ukraine. 
alex-gorobets@mail.ru 


Ranking Brazilian 
research output 


As presidents of the 

Brazilian Association for the 
Advancement of Science and 
the Brazilian Academy of 
Sciences, we object to your 
negative perspective on CAPES, 
our education ministry's 
agency for the Coordination 
for the Improvement of Higher 
Education Personnel (Nature 
500, 510-511; 2013). 

The impact factor is just one 
ofa long list of indicators used 
by CAPES since 1976 to rate 
programmes in 48 fields of 
research, from social studies to 
physics. It is important to stress 
that the agency uses a system of 
peer review to evaluate and rank 
Brazil's graduate programmes — 
not to assess individual curricula 
vitae. The country’s scientific 
community analyses data on the 
performance of each programme 
over the previous three years. 

Other indicators that CAPES 
evaluates include recognition 
of faculty members’ research by 
the international community; 
the coherence, consistency 


RESEARCH PERFORMANCE IN FORMER SOVIET NATIONS 


Only four of the countries sampled have significantly improved their publication 


output in more than a decade. 


Estonia 
Lithuania 
Latvia 
Russia 
Ukraine 
Belarus 


Georgia 
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and comprehensiveness of the 
curriculum and infrastructure for 
teaching, research and continuing 
education; students’ participation 
in research; and the contribution 
of faculty members to the 
supervision of doctoral students. 
There are several ranking 
systems worldwide that also 
incorporate journal impact 
factors and citations into their 
metrics and are acknowledged 
by the scientific community. 
One is the Academic Ranking of 
World Universities (also known 
as the Shanghai Ranking), which 
inspired the Ranking Web (or 
Webometrics) for universities 
around the world. 
Helena B. Nader Brazilian 
Association for the Advancement 
of Science, Sao Paulo, Brazil. 
hbnader.bioq@epm. br 
Jacob Palis Brazilian Academy of 
Sciences, Rio de Janeiro, Brazil. 


Russia embraced 
Wallace’s works 


On the centenary of Alfred 
Russel Wallace's death this 
week, it is worth recalling the 
remarkable influence that the 
naturalist’s ideas had in Russia 
during his lifetime. His major 
works were translated into 
Russian, including his books 
The Malay Archipelago (1869), 
Darwinism (1889) and Man’ 
Place in the Universe (1903). 
Although Wallace was most 
famous as Charles Darwin's 
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co-discoverer of evolution by 
natural selection, the demand 
among Russian intellectuals 
was mainly for Wallace’s own 
publications. Many warmed to 
Wallace’s contention that human 
spiritual faculties cannot be 
explained by natural selection. 
Theologians such as Alexander 
Gusev (1842-1904) used it to 
defend Orthodox Christianity 
against ‘non-believers. Gusev 
even accused the translators of 
Wallace's Contributions to the 
Theory of Natural Selection (1870) 
of censoring ‘unscientific ideas 
and of distorting Wallace's holistic 
view of science and religion 
(G. S. Levit and S. V. Polatayko 
Theor. Biosci. http://doi.org/pq8). 
Russia’s fascination with 
Wallace's work helped to 
shape the debate among early 
evolutionists on alternative 
versions of Darwinism (see 
D. Todes Nature 462, 36-37; 
2009) and opened up discussion 
on the uniqueness of the human 
soul. 
Georgy S. Levit Friedrich Schiller 
University, Jena, Germany, 
University of King’s College, 
Halifax, Nova Scotia, Canada, 
and National Research University 
ITMO, St Petersburg, Russia. 
Uwe Hossfeld Friedrich Schiller 
University, Jena, Germany, and 
National Research University 
ITMO, St Petersburg, Russia. 
Lennart Olsson Friedrich 
Schiller University, Jena, 
Germany. 
lennart.olsson@uni-jena.de 


7 NOVEMBER 2013 | VOL 503 | NATURE | 39 
© 2013 Macmillan Publishers Limited. All rights reserved 


UNESCO INSTITUTE FOR STATISTICS/SCIMAGO JOURNAL & COUNTRY RANK 


OBITUARY 


Harold Melvin Agnew 


(1921-2013) 


Physicist and Manhattan Project veteran. 


arold Melvin Agnew, one of the 
H last surviving members of the team 

that began the nuclear age, died 
on 29 September. Equipped with only an 
undergraduate degree, Agnew helped to set 
in motion the first controlled, self-sustain- 
ing nuclear chain reaction, worked on the 
atomic bomb, and witnessed the bombing of 
Hiroshima in Japan from inside an aircraft 
that was part of the strike operation. 

A native of Colorado, Agnew studied 
chemistry at the University of Denver. 
There, he pitched on a championship soft- 
ball team and earned a scholarship to pursue 
graduate studies at Yale University in New 
Haven, Connecticut. Early in 1942, Agnew 
accepted a position as a research assistant at 
the Metallurgical Laboratory at the Univer- 
sity of Chicago in Illinois, where the world’s 
first nuclear reactor was taking shape under 
the stands ofan athletic field. On 2 Decem- 
ber 1942, he witnessed the first controlled 
nuclear reaction. 

Agnew’s next assignment, beginning in 
early 1943, was to work with a small team 
to disassemble a particle accelerator at the 
University of Illinois and to arrange for its 
transportation to a facility in Los Alamos, 
New Mexico, the new home of the Manhat- 
tan Project to build the first atomic bomb. 
The team included his wife, Beverly, whom 
he had met at high school; while Agnew 
worked on the accelerator, Beverly signed 
on as a secretary and assistant involved in 
the management of the new facility. 

As work on the atomic bomb neared 
completion, the physicist Luis Alvarez (who 
joined Los Alamos in 1944) assembled a 
team that would accompany the weapon 
into combat. Agnew, who stated in a later 
interview that he had “wanted to get in the 
war’, was among the first volunteers. 

Starting in late 1944, the team spent six 
months developing and testing the instru- 
mentation and procedures that would enable 
them to determine from an aeroplane the 
power ofa nuclear explosion. Rather than try- 
ing to measure the level of radiation released 
after the bomb had exploded, the Alvarez 
group decided to use a microphone fitted with 
circuitry that would translate the shock wave 
of the blast into electrical signals that could be 
recorded on an aircraft’s gun camera. 

Agnew and the rest of the team began 
their journey to war at Wendover Air Force 
Base in Utah. Outfitted with uniforms and 
taught how to salute, the scientists were 


flown to Tinian in the Northern Mariana 
Islands, the base from which B-29 aircraft 
were being sent to burn Japanese cities to the 
ground. 

A B-29 called Enola Gay lifted off the 
Tinian runway for Hiroshima at 2.45 a.m. 
on 6 August 1945, with the uranium bomb 
aboard. A second B-29, The Great Artiste, 
took off two minutes later. As well as a crew 
of ten men, the second aeroplane carried 
three scientific observers: Alvarez, Lawrence 
Johnson and Agnew. At 8.15 a.m., bombar- 
dier Kermit Beahan opened the bay doors 
and dropped three parachute-equipped blast 
gauges designed to transmit the magnitude 
and duration of the bomb’s blast wave to 
a receiver on the plane. Each of the three 
scientists manned a receiver tuned to the 
frequency of one of the detecting probes. 
When the bomb was dropped, Alvarez could 
not locate his frequency, and Agnew and 
Johnson provided all the data on the shock 
waves reverberating from the explosion. 

With his primary task accomplished, 
Agnew pulled out his personal 16-milli- 
metre motion-picture camera and filmed 
the mushroom cloud. A plane trailing five 
kilometres behind The Great Artiste was 
equipped with a high-speed still camera, 
but Agnew’s ‘unofficial’ film was the only 
motion-picture footage of the Hiroshima 
explosion. Three days later, Agnew equipped 
the tail gunners of the aeroplanes on the 
Nagasaki mission with film cameras, thereby 
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ensuring a motion-picture record of the 
second atomic strike. 

Agnew ended up giving his footage to 
the Hoover Institution on War, Revolution 
and Peace at Stanford University in Califor- 
nia. “I should have held them until today,’ 
he remarked later, “sold them on eBay and 
become a millionaire”. 

With the war over, Agnew earned a PhD 
in 1949 working under nuclear physicist 
Enrico Fermi at the University of Chicago. 
Returning to Los Alamos, he then joined a 
project to develop hydrogen and thermo- 
nuclear weapons, served as project manager 
for a bomb test on Bikini Atoll in 1954 and 
became director of the Los Alamos National 
Laboratory in 1970. 

Retiring from the post in 1979, Agnew 
was president and chief executive of 
General Atomics, a firm developing inno- 
vative nuclear reactors, headquartered in 
San Diego, California, until 1985. He served 
as scientific adviser to the Supreme Allied 
Commander Europe (one of NATO’s two 
highest-ranking military commanders) 
in the early 1960s and as a White House 
science councillor for much of the 1980s. 
He also chaired and served on several 
military advisory boards, was a member of 
the US National Academy of Sciences and 
the National Academy of Engineering, and 
received various awards. 

A scientific ‘cold warrior, Agnew never 
expressed regret about his involvement in 
the Manhattan Project. “My feeling towards 
Hiroshima and the Japanese was, they 
bloody well deserved it,” he remarked in 
1984. He favoured the use of tactical nuclear 
weapons in Vietnam, and argued success- 
fully against a comprehensive nuclear-test- 
ban treaty, although he did once say that he 
would require every world leader to witness 
an atomic blast while standing in his under- 
wear, “so he feels the heat and understands 
just what he's screwing around with” 

Towards the end ofhis life, Agnew summed 
up the achievements of his generation with 
pride. “We brought a quick end to a devas- 
tating war and maintained the peace and 
eventually saw democracy prevail. That’s 
something you can hang your hat on” = 


Tom Crouch is senior curator of 
aeronautics at the Smithsonian Institution's 
National Air and Space Museum in 
Washington DC. 

e-mail: croucht@si.edu 
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Sculpting neuronal connectivity 


It emerges that a transcription program differentially regulates inhibitory inputs in distinct neuronal compartments 
— an unexpected coordinated switch for achieving experience-dependent ‘plasticity’ in neural circuits. SEE LETTER P.121 


EMILY SYLWESTRAK & PETER SCHEIFFELE 


xperiences trigger long-lasting changes 
E: memory and behaviour. Although 

manifested at the organismal level, such 
changes arise from modifications of individ- 
ual synaptic connections between neurons. 
Specifically, new sensory experiences alter a 
neuron’ activity (its output of action poten- 
tials) by modulating the strength of the synap- 
tic connections that it receives’. On page 121 of 
this issue, Bloodgood et al.” demonstrate that 
a transcriptional program drives highly local- 
ized changes in synaptic connections between 
neurons. The authors’ results thus offer a 
refined molecular explanation for experience- 
dependent learning processes. 

When an animal explores an uncharted 
environment, neural activity originating from 
its sensory organs propagates into the brain, 
where neurons receive a barrage of concur- 
rent excitatory and inhibitory inputs. The net 
activity signal that each neuron receives is 
determined by a delicate balance of these two 
competing forces. In some neurons, excita- 
tion prevails and the cells produce an action- 
potential spike; in others, inhibition overrides 
excitation and the cells remain silent. 

The cellular location of an inhibitory input 
— that is, whether it impinges on a neuron’s 
cell body (soma) or dendritic projections — 
as well as its strength govern how effectively 
excitatory inputs sum up. Somatic inhibitory 
synapses, which lie close to the spike-initiation 
machinery, can effectively cancel excitation’. 
Farther away along the dendrite, an inhibitory 
input can merely dampen an excitatory signal. 
Separate regulation of somatic and dendritic 
synapses allows a neuron to both ‘gate’ its spik- 
ing activity and control the dendritic integra- 
tion of excitatory inputs. Nonetheless, little is 
known about the molecular mechanisms that 
selectively regulate these inhibitory inputs at a 
subcellular level. 

A leading hypothesis has been that neuronal 
activity triggers specific gene-expression pro- 
grams downstream, with the resulting gene 
products modulating transmission at syn- 
apses’. Because these gene products would 
be synthesized in the soma, they have been 
implicated in rather non-selective, cell-wide 
changes in total synapse number or strength’. 
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Figure 1 | Subcellular-specific regulation of neuronal activity. When an animal explores a new 
environment, its hippocampal neurons receive excitatory inputs conveying spatial information. This 
excitation is balanced by inhibitory inputs that impinge on the cells’ soma and dendrites. Bloodgood 

et al.’ show that increases in neuronal activity drive an NPAS4-dependent transcriptional program, a 
key target of which is the gene encoding BDNF. The latter protein selectively stimulates the formation of 
somatic inhibitory synapses and, through parallel, unknown pathways, reduces the number of dendritic 
inhibitory synapses. Thus, the NPAS4 transcriptional program independently controls at least two 


distinct classes of inhibitory synapse. 


The transcription factor NPAS4 is one such 
activity-regulated gene product, which itself 
initiates a gene-expression program that mod- 
ulates the total number of inhibitory synapses 
in vitro’. But whether activity-regulated gene 
products can act on a spatially restricted set 
of synapses to provide more-refined control of 
neuronal function has remained unclear. 

To address this question, Bloodgood and 
colleagues housed mice either in standard 
cages or in enriched environments containing 
running wheels and novel objects to stimulate 
activity in the hippocampus — a brain struc- 
ture that is crucial for spatial navigation and 
memory. Exposure to the enriched environ- 
ment resulted in a marked increase in NPAS4 
levels in hippocampal neurons. Moreover, 
this transcription factor changed the number 
of inhibitory synaptic connections made with 
pyramidal cells, the main excitatory cell type 
in the hippocampus. Most notably, NPAS4 
triggered an increase in somatic inhibitory 
synapses, with a simultaneous reduction in 
dendritic inhibitory synapses (Fig. 1). 

NPAS4 can control the expression of many 
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genes, one or more of which might mediate the 
observed differential effects on the number of 
somatic versus dendritic inhibitory synapses. 
To identify the genes responsible, Bloodgood 
et al. performed a comprehensive screen 
involving three search criteria: for genes that 
are directly regulated by NPAS4; those that are 
strongly upregulated by neuronal activity; and 
those that control the number of inhibitory 
synapses. The gene encoding brain-derived 
neurotrophic factor (BDNF) satisfied all three 
requirements. (Intriguingly, BDNF has also 
been shown’”* to be regulated independently 
in different cellular compartments.) 

The authors’ further analyses confirmed that 
NPAS4 promotes BDNF production during 
bouts of increased neuronal activity. Further- 
more, using mice lacking BDNF, they showed 
that this factor is necessary for the increase in 
the number of somatic inhibitory synapses. 
However, the reduction in dendritic-synapse 
number persisted in these mutants, indicat- 
ing that different NPAS4 target genes regu- 
late this aspect of NPAS4-dependent synapse 
remodelling. 


ANTOINE BRICARD/NICOLAS DESREUMAUX/DENIS BARTOLO 


Bloodgood and colleagues’ characterization 
of the NPAS4-mediated control of specific sets 
of inhibitory synapses is a significant step for- 
ward in our understanding of how activity can 
exert long-lasting changes in synapse number. 
Nonetheless, some questions about the impli- 
cations of these findings remain. 

The hippocampus contains several types 
of inhibitory interneuron cell, and these vary 
in their activity, synaptic function and sub- 
cellular targeting of pyramidal cells’. Whereas 
Bloodgood and co-workers’ experimental 
set-up was designed to separate somatic and 
dendritic inputs through spatially restricted 
stimulation, it would be beneficial to extend 
this study to investigate the specific cell types 
at play. Various mouse lines that allow distinct 
interneuron subpopulations to be accessed” 
could be useful for this purpose. Knowing pre- 
cisely which cell groups are involved will be 
instructive in further refinement of hypotheses 
for how activity-induced changes in inhibition 
affect hippocampal processing and behaviour. 

Another question is how the effects of 
BDNE become confined to specific subcellular 
compartments. Neurons transcribe several dif- 
ferent messenger RNAs of BDNE and it could 
be that activity-induced BDNF mRNAs are 
preferentially trafficked to somatic synapses, 
where they are translated locally. Another pos- 
sibility is that newly synthesized BDNF protein 
is selectively released near somatic synapses. 
Alternatively, specificity might arise from the 
axonal processes of other neurons that make 
contact, such that inhibitory axons targeting 
the soma are more responsive to BDNF than 
their dendrite-targeting counterparts. 

More broadly, the ability of NPAS4 to 
coordinately and bi-directionally modulate 
somatic versus dendritic inhibition has impli- 
cations for synaptic plasticity (the processes 
by which synapses grow stronger or weaker 
depending on their activity level), which 
relates to learning. To induce plasticity at excit- 
atory synapses, the net excitatory-inhibitory 
input must trigger the entry of calcium ions 
into the dendrite’’. Nearby dendritic inhibi- 
tion normally dampens the efficacy of synap- 
tic input. NPAS4 activation would relieve this 
inhibition, making individual synapses more 
likely to undergo the lasting changes associ- 
ated with learning and memory. At the same 
time, increased somatic inhibition would 
tighten the time window in which combined 
synaptic input can produce a spike. The con- 
sequences of both actions are that fewer excita- 
tory synapses would be needed to cause the cell 
to spike, but that those inputs must be more 
temporally coincident. 

Bloodgood and colleagues’ data unify sev- 
eral previous observations concerning the 
activity dependence and synapse-forming 
potential of NPAS4 and BDNE. Moreover, 
they characterize synapse-specific rules gov- 
erning the effects of NPAS4 activation, and 
thereby highlight an extra layer of adaptability 


of neural circuits and a mechanism by which 
sensory input can produce long-lasting 
changes in synaptic connectivity. m 
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Populations of rolling particles have been shown to display unidirectional 
collective motion in a racetrack enclosure. Theoretical modelling suggests that 
hydrodynamic and electrostatic effects promote such behaviour. SEE LETTER P.95 


MICHAEL SHELLEY 


physicist Georg Quincke demonstrated 
that, when suspended in an electrically 
conducting fluid, an insulating sphere will 
rotate if a sufficiently strong electric field 
is applied’. On page 95 of this issue, Bricard 
et al.” use this little-known effect — now called 
Quincke rotation — as the energy-transfer 
mechanism, or motor, to drive populations 
of millions of microspheres to spontaneously 
form rolling herds (Fig. 1). In a racetrack- 
shaped enclosure, these ‘Quincke roller herds 
merge into a unidirectional swarm that circles 
the track (see Fig. 2 of the paper), and increase 
in size with increasing roller concentration. 
The authors interpret and rationalize their 
results using mathematical modelling to 
show that, at short distances, Quincke rollers 
interact mainly through two pairwise effects: 
a hydrodynamic interaction that promotes 
‘polar’ alignment of rolling directions (that is, 
the movement of rollers in the same direction) 
and a repulsive electrostatic interaction. 
Herds of Quincke rollers are a new example 
of active matter. This is the subject of a field 
spanning many disciplines that originated 
through researchers’ desire to understand 
self-organized structures in biology, such 
as bird flocks and bacterial swarms, and the 
cellular cytoskeleton’. Although active matter 
lacks a simple definition, we tend to think of 
active-matter systems as those composed of 
many, possibly identical, interacting particles 
that each use a local energy source to execute a 
change in shape, orientation or position. Roller 
herds are akin to microswimmer suspensions, 
such as bacterial baths*, whose members also 


E a study published in 1896, the German 


interact with each other hydrodynamically. 
But, unlike such free swimmers, Quincke 
rollers move through an applied net torque, 
which results in a very different hydrodynamic 
coupling between them. 

Other synthetic active-matter systems have 
been devised and powered by chemical reac- 
tions’, mechanical vibration’®, hydrolysis of 
the cellular energy molecule ATP’, and light’, 
magnetic’ and electric fields (as for Quincke 
rollers). The interactions between the constitu- 
ent particles of an active-matter system can 
be mediated by several effects: induced flows 
in the surrounding fluid; spatially distributed 
fields such as a chemical concentration or an 
electric field; direct mechanical coupling; 
and collisions. The set of observed collective 


Figure 1 | Roller herds. Bricard et al.” show how 
populations of rolling microspheres confined to a 
track spontaneously form moving herds. Particle 
size is 5 micrometres; the length of the track is 
31.5 millimetres. 
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behaviours is also large — the formation of 
vortices, jets, aggregates, crystals, asters, 
swarms, strange suspension rheologies, defect 
dynamics and persistent turbulent-like motions. 
Active matter is obviously rich in phenomena. 

The strengths of the present study include 
the novelty of the system’s energy-transfer 
mechanism, the relative simplicity of the sys- 
tem’s behaviours (at least under the geometric 
constraints of the racetrack) and the appar- 
ent completeness of the explanatory theory. 
Another strength is that the theory can be 
related to phenomenological models of flock- 
ing’, the parameters of which are generally 
unmoored from specific physics or behav- 
ioural responses. The authors’ experimental 
videos of Quincke roller herds forming, travel- 
ling and merging are well worth watching (see 
go.nature.com/tqlqsp); especially striking is 
the footage showing the rollers’ decidedly 
un-herd-like behaviour when confined to a 
square enclosure. 

I do have two slight quibbles. One of the 
roller swarm’s features is its statistical uni- 
formity, or lack oflarge density fluctuations, at 
high concentrations (see Fig. 4c of the paper’). 
The authors say that large density fluctuations 
have, until now, been considered a hallmark 
of active-matter systems such as theirs. This 
seems a straw-man argument; although large 
density fluctuations are certainly a feature of 
many such systems, their appearance is more 
a matter of scientific interest than of definition. 
A counterpoint example is suspensions of self- 
propelled ‘puller’ particles, which are powered 
from the front. Theoretical studies have shown 
that such puller suspensions, unlike ‘pushers’ 
and despite hydrodynamic interactions, also 
maintain near-statistical uniformity, although 
they show no propensity at all towards swim- 
mer alignment"! (the generously minded 
might consider this a peculiar form of collec- 
tive behaviour). In addition, although Bricard 
et al. put their nonlinear theory to good use 
in calculating the transitions and stability of 
the theory’s steady states, I find ita little disap- 
pointing when studies do not take the extra 
step of simulating the full dynamics and seek- 
ing greater exposure of a theory to experimen- 
tal observations. 

What might further studies of Quincke 
rollers or related systems explore? The present 
study shows that the geometry of the enclo- 
sure is one of the main determinants of how a 
Quincke-roller suspension behaves. Given that 
a single continuous travelling swarm appears at 
high concentrations in the racetrack geometry, 
it would be interesting to see what dynamics 
emerge in a figure-of-eight enclosure in which 
collisions seem inevitable. Because the authors 
conjecture that active-matter systems could 
have applications in understanding social 
phenomena, let me mention one. Pedestri- 
ans in New York show both local alignment 
and repulsion while moving around the city, 
and when two opposing masses intersect at 


a crossing, I have observed the spontaneous 
formation of interwoven lanes that facilitated 
the (mostly) collision-free and efficient dis- 
placement of New Yorkers from one side of 
the street to the other. I suspect that tourist 
masses, often less accustomed to dense city 
life, move around the city in a less orderly way. 
I seriously wonder whether a related physical 
system — perhaps one inducing apolar align- 
ment — might be concocted that would repro- 
duce this observation. I wonder, too, whether 
Quincke rollers might be constrained along a 
wall, perhaps by placing them in depressions, 
so as to make a microfluidic pump. 

Lastly, I believe that a true understanding 
of flocking by birds and fish remains out of 
the reach of small-scale systems such as the 
Quincke rollers described here. This is because 
we still lack an understanding of how large, iner- 
tially dominated, swimming or flying organ- 
isms interact with each other, constructively 
or destructively, through their vortical fields. = 
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Novelty makes the 
heart grow fonder 


Research on guppies provides the most definitive evidence yet for the rare-male 
effect — a long-standing hypothesis to explain the perplexing maintenance of 
variation in traits that are subject to strong mate choice. SEE LETTER P.108 


JEFFREY S. MCKINNON 
& MARIA R. SERVEDIO 


he little fish commonly known as guppies 

are almost as popular among evolution- 

ary biologists as they are among home 
aquarists, because they provide a tantalizing 
opportunity to explore how genetic variation 
is maintained in natural populations. The 
males of this species (Poecilia reticulata) pres- 
ent a dazzling display of colour variation, and 
it is known that female guppies show strong 
mating preferences on the basis of male colour 
patterns. But the maintenance of this colour 
variation presents an evolutionary mystery, 
because sexual selection for a specific form 
of a trait should erode variation in that trait’. 
The discovery” that, in a natural setting, male 
guppies with rare colour patterns survive 
better than more common types, presumably 
because they are less often targeted by preda- 
tors, was an important advance in explaining 
guppy colour diversity. Now, on page 108 of 
this issue, members of the same research 
group (Hughes et al.*) show that wild guppy 
males with rare traits obtain more matings and 
leave more offspring. Thus, it seems that the 
diverse coloration in this species is promoted 
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by both natural and sexual selection. 

Negative frequency-dependent selection — 
which in the case of sexual selection is some- 
times called the rare-male effect — is a process 
in which the evolutionary fitness of a trait 
goes up as its relative abundance goes down. 
The literature on this effect goes back a long 
way and is substantial, but has at times been 
heavily criticized**. Some of the best avail- 
able evidence has come from investigations of 
guppies in the laboratory, in which prefer- 
ences for both unfamiliar and rare traits (phe- 
notypes) have been documented®”. But the 
relevance of such laboratory results to natural 
populations has been open to question because 
ofa lack of studies in natural settings’*. 

Hughes and colleagues’ study is noteworthy 
in large part because it involves naturally 
occurring male colour phenotypes in natural 
pools located along streams in Trinidad. To 
perturb these stream systems from a possible 
evolutionary equilibrium in the abundance of 
different colour patterns, and to enhance the 
power to detect negative frequency-depend- 
ent selection, the authors experimentally 
adjusted the frequencies of different types of 
males (Fig. 1). For balance, they made the same 
phenotypes rare in some pools and common 


Experimental 
generation 


Offspring 


Figure 1 | The rare-male effect in a natural population. Hughes et al.’ adjusted the frequency of male 
guppies in stream pools such that males with coloured tails were common and males with uncoloured 
tails were uncommon (a), or the reverse (b). The frequency of females was kept constant. In both types 
of pool, the authors found that the rare males obtained more matings with females and, as a result, 
contributed proportionally more offspring to the next generation. 


in others. Then, to evaluate the reproductive 
success of the males, they undertook the 
daunting task of genotyping more than 
1,400 offspring of pregnant females collected 
from the pools at the end of the experiment. 
Working across three stream systems and with 
an impressive level of replication, the research- 
ers observed a strong pattern of elevated 
mating and reproductive success for whichever 
male type was rare. 

Although this is an important study, it is 
not without limitations. One potential issue 
stems from the fact that, in guppies, fertiliza- 
tion is internal, and females can store sperm 
for weeks or months. When Hughes et al. col- 
lected females from their experimental pools 
and allowed the fish to give birth in the lab, 
they found strong evidence for a rare-male 
effect among the fathers of the first round of 
offspring. But when the females were obliged to 
use stored sperm for a second brood, the rare- 
male effect among the fathers of those offspring 
was not statistically significant. The authors 
argue that, in nature, female guppies mate fre- 
quently and rarely rely on stored sperm, so that 
all natural broods are essentially equivalent to 
the first broods in their experiment. However, 
the data reviewed by Hughes and colleagues 
in support of this claim are not extensive, and 
more work on this point would be helpful. 

There was also a technical limitation in the 
genotyping technique used by the authors: 
relying on variable genomic regions known 
as microsatellites, they could confidently 
assign paternity to only a minority of indi- 
vidual offspring. Although there is no 
reason to expect this to create any bias in the 
data, it might be worthwhile revisiting the 
samples as more powerful methods for assign- 
ing paternity become available. 

Hughes and colleagues’ evidence for the 


rare-male effect in nature raises intriguing 
questions. The most obvious is, why do female 
guppies prefer males with uncommon colour 
patterns? Here we have more hypotheses than 
data sets. One theoretical analysis suggests 
that greater survival of rare male types, which 
has been found in guppies’, might contribute 
to the evolution of such a preference, even if 
it is costly”. Interestingly, the same analysis 
points out that the negative feedback inherent 
in sexual selection for rarity (such selection 
causes rare types to become common) can ulti- 
mately prevent preferences for rare males from 
becoming ubiquitous. It remains to be deter- 
mined whether preference for rarity varies 
among guppy females in experimental popu- 
lations such as those used in this study. Hughes 
et al. point out that the rare-male effect might 
also emerge if females avoid mating with males 
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that share the colour patterns of their previous 
mates, to increase the genetic diversity of their 
offspring. Alternatively, preferences for rarity 
could have evolved because they lead to lower 
rates of inbreeding if populations are small and 
intermittently isolated. It has even been sug- 
gested” that mating preferences for rare types 
provide no benefit for females and may be a 
manifestation of a preference for novelty that 
has evolved in other contexts. 

The clever manipulations of natural popula- 
tions used by Hughes et al. may, as the authors 
suggest, be a promising avenue for exploring 
the maintenance of variation by frequency- 
dependent selection in other systems. It might 
well be the case that, when put to the test, pref- 
erences for rarity might themselves not prove 
to be so rare. m 
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What to do and how 


Decisions can differ depending on the context that surrounds them. Analyses of 
the prefrontal cortex region of the monkey brain indicate that a dynamical process 
at the neuronal population level controls this behaviour. SEE ARTICLE P.78 


JEFFREY C. ERLICH & CARLOS D. BRODY 


eurons typically receive inputs from 
many thousands of others. This dense 
connectivity suggests that brain func- 
tions such as perception, cognition and motor 
control result from the concerted activity of 
populations of neurons, rather than from single- 
cell activity. Nevertheless, neuroscientists have 


long made discoveries by recording from, and 
thinking about, one neuron at a time. But more- 
recent developments in electrophysiological 
recording' and imaging’ of the dynamics of 
hundreds of neurons simultaneously, along- 
side advances in data-analytic and theoretical 
tools*, highlight the importance of understand- 
ing the relationship between the dynamics of 
large neural populations and the computations 
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those dynamics embody*"". In a tour de force 
reported on page 78 of this issue, Mante et al.” 
use electrophysiology together with sophisti- 
cated behavioural and neural modelling to add 
a population-dynamics view of the computa- 
tions underlying flexible, context-dependent 
behaviour. 

Howyou respond to a glance up a busy street 
should be very different depending on whether 
you intend to cross the road or hail a taxi. In 
the first case, you should pay attention to the 
motion of the cars, in the second case to their 
colour. Such stimulus-feature selection is an 
example of cognitive control’? — our brains’ 
ability to flexibly adapt how we process infor- 
mation from moment to moment, depending 
on current goals. Mante and colleagues wanted 
to know how the brain decides which aspect of 
a stimulus should guide a behaviour. 

The authors designed an elegant task to 
explore this flexibility (Fig. 1a). They trained 
monkeys to look at a screen showing randomly 
moving dots, each dot randomly coloured 
either red or green. In trials in the colour 
context, the monkeys had to decide whether 
there were more red dots or more green dots, 
regardless of the dots’ direction of motion. 
In the motion context, the animals decided 
whether most of the dots were moving left or 
right, regardless of the dots’ colour. In both 
contexts, the same set of stimuli, containing 
motion and colour signals, was used. 

The team then recorded from the monkeys’ 
prefrontal cortex (PFC), a brain region thought 
to be crucial for flexible, context-dependent 
behaviour and which also controls the motor 
activity that animals use to report their deci- 
sions. Consistent with previous work'*!>, the 
activity of individual PFC neurons depended 
in complex, time-varying ways on multiple 


Motion context: 
Is overall motion left or right? (ignore colour) 


Colour context: 
Is there more green or red? (ignore motion) 


aspects of the task, including the motion and 
colour signals in the stimulus, the context 
and the monkey’s decision. In what initially 
seemed a bewildering array of responses, these 
dependencies varied greatly across neurons. 

To look at the data from a population view- 
point, the authors put together all the indi- 
vidual recordings for a given stimulus, and 
defined the neural population's response as 
the dynamical trajectory followed in a high- 
dimensional space in which each dimension 
represents the activity of one of the recorded 
neurons. They then used linear regression to 
find the axes (corresponding to a direction) in 
this space that best represented responses to 
the motion and the colour in the stimulus, and 
the monkeys’ decision (Fig. 1b). They could 
therefore estimate separately how each signal's 
representation in the PFC developed over time, 
and how it depended on the context. 

Which features of the data explained the 
context dependence of the behaviour? Not 
changes in the decision axis, which pointed in 
the same direction in both contexts. Nor could 
the behaviour be explained by irrelevant inputs 
being blocked from reaching the PFC, or by 
the inputs being changed in some context- 
dependent way, because the stimulus axes 
and the stimulus signals (motion and colour) 
on those axes remained remarkably constant 
across the two contexts. Yet, somehow, the 
decision signal was driven by the motion signal 
in the motion context, and by the colour 
signal in the colour context. 

To investigate how the switch in response to 
the different contexts came about, Mante et al. 
used modelling. They took model neurons 
with nonlinear input-output functions, and 
densely connected them in a recurrent net- 
work. They provided the network with inputs 


b Neural activity space: 
trajectories and axes 


A Stimulus 


off 
a aes 
TO Stimulus 


on 


Spikes pers 
neuron N 


te 
eS 
©) 
+ 


“dikes 
Ne ron ® ee 


Decision 


representing the motion and colour signals 
and, separately, the current context signal. The 
activity of one of the model neurons was used 
to indicate the decision of the network. 

The team then ‘trained’ the network by find- 
ing the strengths of the neuronal connections 
required to produce the appropriate, context- 
dependent decision in each trial. Starting the 
training from different initial random connec- 
tion strengths led to different connection pat- 
terns that solved the problem. But cutting-edge 
analyses of how the model networks were solv- 
ing the problem revealed that all the solutions 
shared two dynamical principles. 

First, a line of closely spaced, stable, fixed 
points lay along the decision axis, with the two 
possible outcomes at opposite ends of the line. 
During each trial, stimulus-driven evidence for 
or against a decision accumulated along that 
line by gradually nudging the system from one 
fixed point to the next. Second, in dynamical 
systems, some inputs can cause a long-lasting 
perturbation in the system's trajectory, whereas 
other inputs can be short-lived, and rapidly 
decay away. In Mante and co-workers’ model 
networks, the critical feature that depended 
on context was which of the inputs was short- 
lived. In the colour context, for example, sig- 
nals from the irrelevant input (motion) were 
rapidly quenched and had no lasting impact 
on the neural trajectory, whereas colour signals 
persisted to drive the decision (Fig. 1c). 

In other words, the context dependence was 
not achieved by controlling or gating the flow 
of information from one neuronal population 
to another. Instead, it was achieved within a 
single, densely interconnected cell population 
by controlling the directions along which the 
population dynamics quenched the neural 
activity. This dynamical principle can only be 
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Figure 1 | The experiment and its results. a, Schematic view of the visual 
stimulus used by Mante et al.’ in their study and the context-dependent 
decision their subjects had to make on each trial. b, In the neural activity 
space, each dimension corresponds to the activity of one recorded neuron. 
Together they give rise to the dynamical trajectory followed by the neuronal 
population in response to a particular stimulus (green). The thick coloured 
arrows illustrate the motion, colour and decision axes. ¢, Dynamical 
principle for context-dependent feature selection. In the colour context, 
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one of the directions in neural space along which activity is quenched 
points in the same direction as the motion-input axis. Motion signals 
therefore decay rapidly in the colour context. In the motion context, the 
rapidly quenched direction points elsewhere, allowing the motion input to 
nudge the system from one fixed point to the next along the decision 

axis (red crosses), driving the decision. Neither the stimulus inputs nor 

the decision axes change across the two contexts; only the quenching 
direction changes. 
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understood — in fact, only exists — at the pop- 
ulation level. And because the neural trajecto- 
ries in the model closely resembled the neural 
trajectories in the data, Mante et al. propose 
this principle as the neural basis of flexible, 
context-dependent processing. 

This fascinating study raises some intrigu- 
ing questions. First, how do we go beyond 
correlations and causally test the authors’ pro- 
posed principle? With existing perturbation 
methods, we cannot align manipulations with 
the critical axes that define the population’s 
dynamical behaviour. Second, is the region of 
PFC that the authors recorded from even nec- 
essary for the context-dependent behaviour 
investigated here? The behavioural task used is 
new, so this question has never been addressed. 
Third, the conclusions on the behaviour of 
neural populations are based on data from 
many separate single-neuron recordings. Will 
the same principles hold when multi-neuron 
recordings are used to directly see the popula- 
tion neural trajectories in single trials? Finally, 
what is the specific mechanism that the net- 
work uses to change its quenching directions? 
These questions aside, Mante et al. have led 
the field with novel and deep ideas that should 
drive much discussion and thinking. = 
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Uncertain then, 
irrelevant now 


Uncertainty in estimates of the effects of aerosols on climate stems from poor 
knowledge of the past, pristine atmosphere — so getting a better understanding 
of these effects might not be as useful as was thought. SEE ARTICLE P.67 


BJORN STEVENS 


erosol particles in the atmosphere have 

long been feared to be the joker in the 

climate system's pack of cards. Even 
before the greenhouse effect became a house- 
hold word, scientists had begun to consider the 
possibility that increasing concentrations of 
particles in the atmosphere’s lowest layer (the 
troposphere) were acting to cool the planet, 
and thus masking what would otherwise be 
much larger temperature changes caused by 
rising concentrations of atmospheric carbon 
dioxide’. Reporting on page 67 of this issue, 
Carslaw et al.” use an innovative approach 
to demonstrate that most of the uncertainty 
in estimates of aerosol forcing — the aerosol 
perturbation caused by humans that induces 
a global surface-temperature change — that is 
associated with cloud-affecting aerosols can be 
attributed to uncertainty in the emission rate 
of aerosol precursors from natural sources. 


Their work suggests that, if there is an aerosol 
joker, it was probably played a century ago, 
and has become irrelevant to understanding 
present and future changes in global climate. 
The aerosol, like the Earth system itself, is 
fascinatingly complex. How it interacts with 
clouds and radiation depends not only on its 
composition and concentration, but also on 
how these properties are distributed across 
particles of different sizes. All of these factors 
depend on uncertain microphysical processes 
and poorly understood aerosol sources, which 
in turn are integrated by the vagaries of the 
wind. Consequently, estimates of the strength 
of aerosol forcing derived from models that 
attempt to represent these poorly understood 
factors are controversial’. Carslaw et al. had the 
brilliant insight to recognize that something 
could be learned from this uncertainty. 
Using a comprehensive model of global 
aerosols to train a simple statistical model, the 
researchers characterized the 28-dimensional 
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uncertainty landscape of the 
comprehensive model. By explor- 
ing this landscape, they find that 


poorly constrained natural sources aie 
determine the state of the pris- E 

tine atmosphere, and thus domi- = 
nate uncertainty in estimates of s 

radiative forcing associated with & 
aerosol-cloud interactions. Their E 

findings aid the interpretation of 2 
earlier studies in which seemingly 5 

large differences in aerosol forc- 

ing have been found to be associ- ' 

ear 


ated with relatively small changes 
in rather arbitrary parameters 
— such as the minimum number 
of cloud droplets permitted in a 
model of a cloud*. Carslaw and 
colleagues’ work suggests that 
these parameters essentially act as 
a proxy for the state of the pristine 
atmosphere. 

The disproportionate sensitivity 
of aerosol forcing to the background 
state of the pristine atmosphere 
suggests that the present, far-from- 
pristine atmosphere should be 
quite well buffered against further 
changes in sources of tropospheric 
aerosols. Consistent with this line of think- 
ing, the authors used their model to show that 
the aerosol forcing from an important class of 
aerosol-cloud interaction has not changed for 
more than 30 years, a period that has witnessed a 
massive increase in the footprint of human 
influence on Earth and its atmosphere. Aerosol 
forcing is thus so well buffered that it has prob- 
ably ceased to be relevant for present and future 
global climate changes. 

Because aerosol forcing tends to plateau 
as aerosol concentrations rise, climate sensi- 
tivity — the change in the globally averaged 
temperature that accompanies a doubling of 
atmospheric CO, concentration — is better 
constrained from observations of the more 
recent past (for which changes in CO, forc- 
ing dominate; Fig. 1) than by referencing a 
distant, pre-industrial era whose atmospheric 
aerosol burden, Carslaw et al. argue, might well 
be unknowable. An analysis’ of observations 
over this more recent past, which discounts 
past uncertainty in aerosol forcing and exploits 
our better understanding of other elements of 
Earth's energy budget, such as average ocean 
heating rates, suggests that a surprisingly large 
climate sensitivity is not being masked by a 
large effect of aerosols, as was initially feared’. 

Because changes in tropospheric aerosols 
are localized near their sources, the above 
arguments do not rule out the possibility 
that anthropogenic emissions cause large 
regional effects. But if one follows the logic 
of Carslaw and co-workers’ findings, such 
effects will also have been most evident in 
the more distant past — particularly for a sce- 
nario in which natural emissions of aerosol 
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clear signal from the past, it seems 
reasonable to assume that the 
climatic effect of future changes 
in aerosol emissions, even on a 
regional scale, will not be large. 
But the ultimate lesson to be 
drawn from Carslaw and col- 
leagues’ research, and from other 
studies® ” published this year, is 
that a lack of understanding of basic 
processes — suchas how patterns of 
cloudiness and large-scale atmos- 
pheric, or oceanic, circulation 
respond to warming — is limiting 


Aerosol forcing important? 


Figure 1 | Climate forcing by tropospheric aerosols and carbon dioxide. 
The dark blue line represents the magnitude of the cooling effect from a 
tropospheric aerosol-forcing scenario in which a pristine natural atmosphere 
is strongly susceptible to perturbation by anthropogenic aerosol sources 
produced before the twentieth century. The light blue line depicts a scenario 
in which stronger natural sources of aerosols render the atmosphere less 
susceptible to human perturbations. The red line indicates the magnitude 

of the warming effect from forcing caused by changes in CO, and other 
long-lived greenhouse-gas levels. Shaded bands represent the more complex 
pattern of global warming over the past century (green, strong warming; 
grey, modest warming). Carslaw et al.’ report that aerosol forcing has hardly 
changed since 1980, and so forcing has been dominated by CO, changes over 
the past 33 years. (Curves adapted from data in refs 2 and 5.) 


precursors are low, thus making the pristine 
atmosphere especially susceptible to perturba- 
tions in aerosols and their precursors (Fig. 1). 
In fact, there is surprisingly little evidence 
of changing weather patterns attributable to 
large alterations in regional air quality over 
North America and Europe in the past. But 
more effort should be devoted to identifying 
a smoking gun for regional effects of aerosols 
in historical records, for instance through 
a systematic study of temperature changes 
in the twentieth century. In the absence of a 
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CO, dominates 


progress in quantifying present and 
future changes in Earth’s climate. 
Only by refocusing research on 
these basic processes does climate 
science stand a chance of remaining 
relevant toa public that is grappling 
to understand the pace and pattern 
of climate change. = 
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Good and bad news on 
the adolescent brain 


In response to bad news about risk, young adolescents alter estimates of their 
own vulnerability to adverse events less accurately than older people. The 
finding has implications for managing risk-taking behaviour in young people. 


VALERIE F. REYNA 


inston Churchill once referred 
to Russia as “a riddle wrapped 
in a mystery inside an enigma’. 
Adolescence could be described in the same 
way. Adolescents are at the peak of physical 
robustness, yet they are also at an elevated 
risk of death, injury and disease. Much of this 
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increased mortality and morbidity has been 
blamed on risky decision-making processes 
performed by a still-maturing brain’. Massive 
public-health campaigns and educational 
efforts have been aimed at reducing unhealthy 
risk taking, and have generally focused on 
exposing adolescents to information about the 
adverse health consequences of risky behav- 
iours. Writing in Proceedings of the National 
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Academy of Sciences, Moutsiana et al.’ presenta 
study that raises questions about how effective 
all of this ‘bad news is in teaching adolescents 
to update their risk estimates. 

Previous research has shown that adults 
are more likely to incorporate good news 
than bad news into their beliefs. For exam- 
ple, news that a genetic mutation places them 
at lower risk of cancer than they thought is 
more likely to alter an adult’s belief about his 
or her cancer risk than is news of a mutation 
that puts them at higher risk. Moutsiana and 
colleagues have now examined this process of 
belief updating in response to good and bad 
news in individuals aged from 9 to 26 years, 
a period during which the brain is matur- 
ing. The participants were presented with 
40 adverse events and asked for their estimates 
of personal risk — how likely (as a percentage) 
the event was to happen to them in the future. 
Then, the actual likelihood was presented 
and, in a second session a few moments later, 
they re-estimated their personal risk of the 
same events (Fig. 1). 

The researchers found that updating beliefs 
about risk did not change significantly with 
age for good news (when participants were 
at a lower risk of an adverse event than they 
thought). But information about being at 
a higher risk — bad news — was correlated 
with age, such that beliefs were not suffi- 
ciently updated in younger participants. The 
gap between learning from good versus bad 
news narrowed with age. The authors con- 
trolled statistically for a host of alternative 
explanations for these age differences, includ- 
ing prior beliefs and past experience with the 
events. They also ensured that the youngest 
respondents understood the concept of per- 
centages and that none of the respondents was 
depressed, which could have produced differ- 
ences in responses to negative events. 

In the brain, dopaminergic neurons are 
activated in response to outcomes such as 
rewards; in particular, they signal the discrep- 
ancy between expected and actual rewards 
— referred to as prediction-error signals. 
Moutsiana et al. argue that the good-bad 
differences they document originate in an 
asymmetry in the development of dopamin- 
ergic responses to prediction-error signals. In 
fact, hyper-responses to rewards in the stria- 
tum region of the brain (a rewards area) peak 
during adolescence’, although some research 
suggests’ that adolescents are hyporesponsive 
to rewards. By contrast, activity in the brain's 
prefrontal cortex region (a cognitive-control 
area), specifically the inferior frontal gyrus, is 
associated with updating beliefs to negative 
errors in this estimation task. 

Moutsiana et al. also recorded an 
important developmental difference in 
individuals’ memories of the presented risk infor- 
mation, assessed by asking the participants the 
presented actual probability at the end of the 
sessions. They found that misremembering 
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Figure 1 | Adverse events. Moutsiana et al.” studied how adolescents’ estimates of their own risk 
of experiencing an adverse event, such as a broken bone, changed in response to information about 
their actual risk. 


was related to age for both good and bad news, 
with younger participants remembering less 
accurately. Furthermore, memory for risks 
correlated with updating of risk estimates 
across the age range. Nevertheless, although 
such memory differences influence risk per- 
ception and behaviour’, they do not explain the 
good-bad differences observed in this study. 
Much of the research on adolescent risk 
taking, including this study, is steeped in theo- 
ries of neo-Freudian dualism, in which the pre- 
frontal cortex (Sigmund Freud’s ‘super-ego’) 
suppresses the urges of an instinctual limbic 
system (the ‘id’). But this latest work high- 
lights the often-neglected issue of learning. 
In an earlier generation of research, learning 
was the centrepiece of psychology — gener- 
ally speaking, to be a psychologist was to be 
a learning theorist. Learning also offered the 
hope of change, by definition, because learning 
involves an alteration in knowledge or beliefs. 
Current theories of the adolescent brain 
rightly emphasize socio-emotional develop- 
ment’, but one lesson from Moutsiana and 
colleagues’ study is that cognitive and motiva- 
tional changes from childhood to adulthood 
are intertwined. Learning, which is a cognitive 
faculty, occurs for positive and negative infor- 
mation, but it occurs differently depending on 
motivational salience (people are more moti- 
vated to believe good news than bad news). 
This work opens the door to many ques- 
tions about risk perceptions and their effect 
on behaviour. Like many others in the field 
of risky decision-making, this study extracts 
numerical risk estimates from participants 
and provides them with other such estimates. 
How do we know that a person understands 
his or her personal risk? We ask them to parrot 
a probability. Although there are advantages to 
this method (for example, it permits intrigu- 
ing analyses of calibration and updating), it 


ignores a basic psychological truth about risk 
estimates: that the estimates used by people 
in the real world to guide their behaviour are 
mentally represented not as precise quantities 
but as qualitative gists, such as ‘a really high 
risk’. Researchers in medical decision-making 
and public health have come to realize that 
simply remembering and repeating a number 
verbatim does not capture how most people 
represent risk in their minds and brains*”°. 
Moutsiana and colleagues’ exciting findings 
highlight the related concept of the emotional 
gist of risk information, which is likely to col- 
our intuitions about risk differently across the 
lifespan''. The riddle of adolescent risk tak- 
ing may not be solved any time soon, but we 
are seeing a great leap in scientific knowledge 
about this topic, with broad implications for 
enhancing public health and well-being. m 
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Cortical connectivity and sensory coding 


Kenneth D. Harris’? & Thomas D. Mrsic-Flogel?* 


The sensory cortex contains a wide array of neuronal types, which are connected together into complex but partially 
stereotyped circuits. Sensory stimuli trigger cascades of electrical activity through these circuits, causing specific 
features of sensory scenes to be encoded in the firing patterns of cortical populations. Recent research is beginning to 
reveal how the connectivity of individual neurons relates to the sensory features they encode, how differences in the 
connectivity patterns of different cortical cell classes enable them to encode information using different strategies, and 
how feedback connections from higher-order cortex allow sensory information to be integrated with behavioural 


context. 


ur senses are constantly bombarded with a vast amount of 

information. To guide behaviour, the brain must extract and 

amplify a relatively small number of features from this massive 
input; for example, features corresponding to the nature and location of 
objects in the world. In mammals, the cerebral cortex has a central role 
in this process. The cortex contains multiple hierarchically arranged 
areas devoted to each modality. Neuronal populations in these areas 
extract features of sensory scenes and integrate them with non-sensory 
cognitive and behavioural variables, in a progressively abstract manner’. 
Each sensory stimulus causes a complex pattern of activity in the neur- 
onal populations of multiple cortical areas. The relationship between 
sensory stimuli, and the firing patterns they evoke, defines the ‘neural 
code’ of the corresponding populations. 

Each cortical area contains a richly interconnected array of diverse 
cell types, whose patterns of connectivity underlie the cortex’s ability to 
extract sensory features. The circuits of different cortical regions and 
species share striking commonalities in their constituent cell types, their 
intrinsic properties, and the incidence and properties of synaptic con- 
nections between them*’. Connections between cortex and other brain 
structures, and connections between cortical regions, also show a largely 
preserved relationship to cortical cell classes. These similarities suggest 
that there may be a set of general principles linking the common char- 
acteristics of cortical circuitry to the nature of cortical processing in 
multiple areas. 

If such common principles exist, they are unlikely to describe which 
sensory features are encoded in particular cortical areas, but rather how 
these features are encoded. Cortical neurons detect complex and non- 
linear features from sensory scenes, whose precise character will obviously 
differ between sensory modalities. Nevertheless, there are notable com- 
monalities in the strategies with which populations of different cortical 
areas encode diverse types of information, and in how these strategies 
differ between neuronal subtypes. An understanding of these strategies in 
turn enables us to consider why cortex might be organized in this way; for 
example, why the coding strategies used by each cell class are appropriate 
to their particular role in the circuit and their particular synaptic targets. 

In recent years there has been a great acceleration in progress towards 
answering these questions, enabled by a number of newly developed 
experimental techniques. Advances in mouse genetics have provided 
tools to classify, identify and manipulate different classes of cortical 
neuron**. Optical and electrical techniques now allow scientists to 
simultaneously record the activity of large numbers of neurons in the 
living brain’*. Advances in circuit mapping techniques allow neuronal 


connections to be reliably identified”"''. Optogenetic techniques allow 
scientists to control neuronal activity with high temporal resolution 
using light-gated proteins'’. Most importantly, recent studies have 
begun to apply these techniques in combination, directly relating cor- 
tical circuitry to sensory coding. Here we review some of the insights 
recently made with these techniques, with a specific focus on the visual, 
auditory and somatosensory cortices of rodents. 


The organization of cortical connectivity 


Cortical neurons divide into two major classes. Principal cells are neurons 
that use the excitatory neurotransmitter glutamate. Usually pyramidal in 
shape, these cells respond selectively to specific features of sensory stim- 
uli, and contact local and distant targets through extensive axonal pro- 
jections. Principal cells comprise approximately 80% of cortical neurons 
in rodents, and fall into multiple classes distributed across and within 
cortical layers (Box 1). The remaining approximately 20% are interneur- 
ons that release the inhibitory neurotransmitter GABA and make mostly 
local connections (Box 2). Both principal cells and interneurons comprise 
multiple subclasses, whose classification is an active area of research. 

The probability of connection between two neurons, and the physio- 
logical properties of the corresponding synapses, depends on the pre- 
and postsynaptic cell types (Boxes 1 and 2). Connectivity within a single 
cell class is also highly structured. Analysis of connections between 
principal cells reveals an overrepresentation of bidirectional connec- 
tions between pairs, and of mutual interconnection motifs in larger sets 
of neurons'*"* (although this was not seen in ref. 15). Such non-uniform 
connectivity occurs both within and between layers, with principal cells 
more likely to exhibit reciprocal connections if they receive common 
intra- or inter-laminar input'®’’. These data suggest that each cortical 
area contains multiple interdigitated subnetworks of highly intercon- 
nected principal cells (Fig. 1). Note that although motifs such as bidirec- 
tional connections can occur several times more frequently than expected 
by chance, they do not occur with a probability of 1. This suggests that the 
subnetworks are not discrete: not every pair of neurons within a subnet- 
work is connected, and each principal cell can belong to multiple subnet- 
works. Unlike principal cell-principal cell connectivity, connectivity between 
principal cells and at least some interneuron classes seems to be nonspe- 
cific. Although interneurons can show precise targeting of connections to 
specific cell classes and subcellular locations, their connection probability 
to neighbouring principal cells is close to 100%, at least for parvalbumin- 
and somatostatin-expressing interneurons'*””. 
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Sensory information arrives from primary thalamus into all 

cortical layers, but most densely into L4 and the L5-L6 border?°*°°. 
Contextual inputs from higher-order cortex and thalamus most 
densely target L1, L5 and L6, but avoid L4 (refs 2, 9). These inputs 
are integrated by a diverse array of cortical cell types. The figure 
shows a current understanding of the connectivity between the 
major principal cell classes of sensory cortex. Line thickness 
represents the strength of a pathway; question marks indicate 
connections that appear likely but have not yet been directly 
demonstrated. 

L4 principal cells comprise two morphological classes, pyramidal 
and spiny stellate cells, whose intrinsic properties and coding 
strategies appear largely similar?®. The architecture of L4 varies 
between species and modalities, which may reflect its developmental 
shaping by thalamic innervation and activity?”°° in which the L4- 
specific gene RorfB has been implicated°°. L4 principal cells project to 
all layers, but most strongly L2/3. However they receive little 
intracolumnar input in return, as evidenced both by paired recording 
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Canonical connectivity of cortical principal cells 


studies", and the fact that optogenetic stimulation of L2/3 has little 
effect on L4 (ref. 46). 

L2/3 principal cells are often considered homogeneous, but 
comprise as-yet incompletely classified subtypes, with different 
patterns of axonal targets, gene expression, and in vivo firing 
patterns!©°. Their major outputs are to higher order and 
contralateral cortices, as well as locally to L5. 

L5 principal cells comprise two broad subclasses, whose fate is 
developmentally determined by suppressive interactions between the 
genes Satb2, Fezf2 and Ctip2 (ref. 102). ‘Intratelencephalic neurons’ 
(ITNs) are often found in upper L5 (ref. 103), and fire adapting spike 
trains at moderate rates. They project locally upward to L2/3 and 
distally to the ipsi- and contralateral cortex and striatum, but not to 
targets outside the telencephalon. ‘Subcerebral projection neurons’ 
(SPNs) are larger cells with prominent dendritic tufts in Ll and a 
periodic spatial organization. SPNs show little spike train 
adaptation, and under some—but not all—conditions fire in bursts?°°. 
SPNs receive prolific inputs from multiple cortical cell classes, whose 
development depends on Shh signalling!°°. However, they give little 
local output in return®?, therefore exhibiting the inverse connectivity 
pattern to L4 principal cells. SPNs of even primary sensory cortices 
project to subcerebral motor centres, and can directly drive 
movements?°”. Their axons send collaterals to ipsilateral striatum and 
higher-order thalamus with large, strong ‘driver’ synapses?©2°°, 
Together, these characteristics define a class of neuron that 
accumulates information from an entire cortical column, and 
broadcasts sustained, powerful outputs to distant targets. 

L6 principal cells comprise at least two subclasses with distinct 
molecular identities and projection targets'!°, whose fate is partially 
determined by the gene Tbr1 (ref. 102). Corticocortical cells (CCs) 
have small dendritic trees, long-range horizontal axons, and 
occasionally bizarre morphologies such as inverted somata?!?. 
Corticothalamic cells (CTs) send projections to thalamus which, unlike 
hose of L5 SPNs, are weak, target the reticular and primary sensory 
halamic nuclei?©8, and travel through slowly conducting fibres (as low 
as 0.5 ms _? in rabbit)®°. Corticothalamic cells also project to cortical 
ayer 4, where they strongly target interneurons'?!, as well as 
hyperpolarizing principal cells via group Il mGluRs!?*. Consistent with 
his connectivity, optogenetic stimulation of L6 in vivo suppresses 
cortical activity, suggesting a role of this layer in gain control or 
ranslaminar inhibition??°. 


Functional specificity of excitatory subnetworks 

Synaptically connected principal cells tend to process similar types of 
information. In mouse visual cortex, the connection probability between 
nearby superficial principal cells is higher for neurons that respond to 
similar visual features*®. Connection probability is elevated for neurons 
sharing the same orientation preference, but is higher still for neurons 
responding similarly to natural movies. This suggests that connection 
probability mirrors similarities in the complex and as-yet poorly under- 
stood feature combinations extracted by visual cortical neurons. The 
relationship between connectivity and sensory tuning is preferential 
but not exclusive: not all neurons that respond to similar features are 
connected, and not all connected neurons respond to identical features. 
This indicates that the neuronal populations that respond to different 
sensory features are not discrete and disconnected from each other, but 
interconnected and overlapping. Consistent with functionally specific 
recurrent connectivity, whole-cell recordings in vivo indicate that the 
excitation principal cells receive from their cortical neighbours is tuned 
similarly to their direct thalamic input, at least with regard to simple 
features such as visual orientation and sound frequency”!*. Functionally 
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specific principal cell connectivity may be a general feature of at least 
visual cortex, as long-range horizontal excitatory projections in carnivor- 
ans are also biased towards neurons with similar stimulus preferences”. 
In contrast to principal cells, local excitatory connections onto interneur- 
ons (at least, onto parvalbumin-expressing interneurons) seem unrelated 
to feature preference’””®. 

These data suggest that the interconnected subnetworks postulated 
in the previous section consist of neurons that are largely, but not 
exclusively, tuned for similar sensory features. This organization may 
provide several benefits. First, recurrent excitation amplifies cortical 
responses’ **”’, By reinforcing thalamic inputs, recurrent excitation 
can increase the spiking probability of individual neurons, and increase 
the number of neurons responding to a given stimulus. This will reduce 
susceptibility to noise and boost the efficiency with which they can 
drive downstream targets. Second, recurrent excitation prolongs sensory 
responses’ **, This will again boost the efficacy of sensory responses, and 
may allow responses to brief stimuli to be maintained for long enough to 
interact with subsequent sensory inputs as well as later-arriving feedback 
from higher-order regions. Third, recurrent excitation may allow for 
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BOX 2 
Classes of inhibitory interneuron 


The cortex contains three major families of interneuron, each of which 
divides into multiple subclasses. The figure shows a current 
understanding of the synaptic targets of five classes of cortical 
interneuron, with the green neuron representing a principal cell. 
Question marks indicate connections that appear likely but have not 
yet been directly demonstrated. 

Parvalbumin-expressing interneurons (PVs) are capable of firing 
rapidly and with high temporal precision. They consist of two main 
subgroups: basket cells (BCs) that target the soma and proximal 
dendrites of principal cells, and chandelier cells (ChCs) that target the 
axon initial segment. PV cells receive strong excitatory inputs from 
thalamus and cortex'4, as well as inhibition from other PVs?!°. A key 
role of these cells is to stabilize the activity of cortical networks: their 
absence leads to epileptiform activity, whereas more moderate 
chronic dysfunction of these cells has been implicated in diseases 
such as schizophrenia?!®. 

Somatostatin-expressing interneurons (SOMs) consist largely, but 
not exclusively, of Martinotti cells that target the tuft dendrites of 
principal cells, as well as inhibiting other interneurons!!°"!’, 
Consistent with their targeting of dendritic tufts, these cells have 
been implicated in behaviour-dependent control of dendritic 
integration®®1!®, as well as in more general lateral inhibition’. 
Connections from principal cells to SOMs show facilitating 
synapses!!9!2°, whose establishment depends on postsynaptic 
expression of the cell-surface protein Elfn1 (ref. 121). In contrast to 
PVs, SOMs receive the majority of their input from local principal cells 
but little inhibition or thalamic drive*71191"7, 

5HT3A-receptor-expressing interneurons are the most numerous 
interneuron of the superficial layers!#°. Although the classification of 
these neurons is still incomplete, they contain two prominent 
subgroups: neurogliaform cells (NGs), which are thought to release 
GABA by volume transmission?*?; and cells that express vasoactive 
intestinal peptide (VIP) and preferentially target SOMs®*)°, Putative 
5HT3A-receptor-expressing cells have been implicated in learning!2* 
and control of cortical circuits by higher-order cortex and 
thalamus®?2°, 


REVIEW 


Figure 1 | Proposed fine structure of neocortical connections. Connectivity 
analyses suggest that cortical circuits contain multiple interdigitated 
subnetworks of highly interconnected principal cells. Red, blue and green 
triangles represent principal cells in three subnetworks; arrows of the 
corresponding colour represent synaptic connections within each subnetwork. 
Notall cells within a subnetwork are connected, and any cell can belong to more 
than one subnetwork (striped triangles). Unlike principal cells, parvalbumin- 
expressing interneurons (grey circles) connect nonspecifically to principal cells 
in their local area. 


more complex feature integration in sensory scenes. Although connec- 
tion probabilities are elevated between neurons of the same feature pref- 
erence, substantial connectivity also exists among neurons with different 
feature preference. If these connections link neurons driven by features 
whose combination is of particular behavioural relevance (for example, 
features forming extended visual contours or spectrally complex sounds), 
recurrent connectivity may allow the subnetwork to specifically amplify 
responses to these combinations. 


Establishment of cortical connectivity patterns 
Cortical circuits are wired by a combination of molecular cues and 
activity-dependent synaptic plasticity. Recent research is beginning to 
reveal the molecular signals governing the incidence and synaptic prop- 
erties of connections between specific cell classes (Box 1). However, a 
key question is what determines the non-uniform connectivity patterns 
within a principal cell class. The first possibility is that interconnected 
subnetworks reflect still-undiscovered neuronal subtypes. For example, 
connection probabilities within the apparently uniform class of callosally 
projecting layer 5 (L5) principal cells could be predicted by similarity 
of physiological characteristics'’, suggesting preferential connectivity 
between cryptic subclasses. Recent work suggests that clonal sister neu- 
rons (that is, neurons arising from divisions of acommon progenitor cell) 
are more likely to be synaptically connected and share orientation pref- 
erence than unrelated cells**. A single progenitor can give rise to pref- 
erentially connected principal cells of multiple subclasses, suggesting that 
early developmental processes beyond subclass specification contribute 
to wiring intracortical circuits. Transient electrical synapses between 
clonal sisters before eye opening have been implicated in this process”. 
These early developmental processes, however, seem to simply provide 
a starting point for activity-dependent refinement of cortical circuitry 
after the onset of sensory experience. Indeed, studies in superficial visual 
cortex suggest the relationship between principal cell connectivity and 
visual feature preference is weak at eye-opening, only becoming strong 
after visual experience, which suggests that sensory input leads to the 
formation of functionally specific subnetworks*’. An intuitive argument 
suggests that Hebbian plasticity—strengthening of connections between 
principal cells that fire together—should lead to interconnected subnet- 
works of cells tuned to similar or commonly co-occurring features; 
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indeed, this is the essence of Hebb’s original ‘cell assembly’ hypothesis*’. 
Understanding how more complex cortical plasticity processes sculpt 
cortical networks to detect salient sensory features, is an important topic 
of research in computational and experimental neuroscience*”**”»*. 


Coding strategies differ between cell classes 


Different cortical cell classes encode information differently. The major 
difference is not in what information is encoded (for example, pref- 
erence for a particular sensory feature), but in the manner in which it 
is encoded. The best-understood difference in coding strategy between 
cell classes is in their ‘sparseness’. For neuronal classes employing a 
‘sparse code’, information is encoded at any instant by the spiking of a 
small subset of cells within the population. At the other end of the 
spectrum (a ‘dense code’), most neurons are active at any moment, with 
information encoded by variations in their firing rates. Sparse coding is 
typically associated with low mean firing rates and high selectivity of 
individual neurons for sensory features, although in principle this asso- 
ciation is not absolute*®. Sparse codes are considered efficient, because 
they are able to transfer information with fewer spikes*’. In addition, 
sparse codes may also increase information storage capacity at the level 
of neuronal populations, because of a smaller overlap in representations 
for different stimuli**. However, sparse coding imposes a different type 
of cost, as the large number of neurons required to support sparse codes 
must occupy a significant spatial volume. This cost would be particularly 
acute for physically larger neurons, such as those that must support 
long-range axonal projections. 


Principal cells 
Recordings in multiple modalities indicate that L2/3 principal cells 
encode information sparsely (Fig. 2a). L2/3 principal cells have low 
spontaneous and evoked firing rates (usually <1 spike per s in rodents), 
and respond selectively only to a narrow range of stimulus features”. 
This sparseness is likely to result from a combination of strong, unse- 
lective inhibitory connections, and functionally specific excitation. L2/3 
contains large numbers of densely connected interneurons*’, which 
provide powerful synaptic inhibition, particularly in awake animals“. 
Both in vitro and in vivo, optogenetic stimulation of L2/3 principal cells 
causes predominant inhibition rather than excitation in other L2/3 
principal cells**“* (Fig. 2b). Consistent with strong lateral inhibition, 
recent work suggests that at least in auditory cortex, superficial layer 
activity in vivo is sparse and patchy at a spatial scale of <200 tm*™”. 
In contrast, L5 principal cells—in particular subcerebral projection 
neurons (SPNs; see Box 1)—encode information densely, firing at rela- 
tively high rates (5-15 spikes per s) and responding to a broad range of 
stimuli in a graded manner®**”° (Fig. 2a). This firing pattern again 
mirrors their connectivity, as L5 SPNs integrate thalamic and intracortical 
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Figure 2 | Coding strategies of different cortical 
layers. a, Cortical activity is sparse in superficial 
layers but dense in deep layers. Left, inferred 
location of a multisite electrode in rat auditory 
cortex, with recording sites indicated by yellow 
squares. Right, raster plot of activity, with tick 
marks indicating spike times of simultaneously 
recorded neurons at the corresponding cortical 
depths. b, Cell-type-specific coding may result 
from stronger inhibition in superficial and stronger 
excitation in deep layers. Green-filled triangles 
represent active neurons; empty triangles represent 
silent neurons. Arrows represent net excitaton, 
thick black lines represent net inhibition. Part a is 
adapted from ref. 40. 


excitatory input from diverse principal cell classes and subnetworks, while 
receiving weaker inhibition’*'~*>*. Again consistent with this connectivity, 
in vivo estimates of synaptic currents suggest putative SPNs receive broadly 
tuned excitatory drive, compared both to the excitation received by other 
cell classes and to the inhibition they themselves receive*"**. Furthermore, 
optogenetic stimulation of L5 in vivo—unlike stimulation of L2/3—causes 
self-sustaining activity, consistent with strong recurrent excitation in L5 
circuits**°° (Fig. 2b). 

By using different coding strategies in different populations, the cortex 
may have found a way to balance the benefits and costs of sparse coding. 
We propose that sparse coding in a large number of physically smaller, 
intracortically projecting neurons (L2/3 principal cells) may allow effi- 
cient information storage in local synapses; the use of dense coding in a 
smaller number of large, long-range projection cells (L5 SPNs) may 
allow efficient broadcast of the results of cortical computations to distant 
structures, without requiring excessive physical volume. Thus, the differ- 
ential coding schemes of L2/3 and L5 principal cells appear tailored for 
the targets their axons innervate. 


Interneurons 

The way information is encoded in the cortex’s multiple interneuron 
classes is a topic of active research and some controversy*®*”®. Inter- 
neuron activity can be strongly and diversely modulated by non-sensory 
factors such as ongoing behaviour (see below), but interneurons are also 
driven by sensory input. Unlike principal cells, interneurons (at least, 
parvalbumin-expressing interneurons) receive functionally unspecific 
inputs from nearly every principal cell in the local area'®’®. Consistent 
with this broad connectivity, interneuron sensory tuning seems to appro- 
ximate the average of the principal cells in their local neighbourhood”. As 
a consequence, even superficial interneurons seem to use a dense coding 
strategy more similar to deep principal cells than to their superficial 
principal cell neighbours”. However, this coding strategy is intimately 
related to the way activity is organized across the cortical surface, as 
discussed below. 


Connectivity, coding and cortical maps 


One of the most prominent differences in cortical organization between 
modalities and species is in the spatial mapping of information on the 
cortical surface. Often, the cortical surface maps the organization of prim- 
ary receptor surfaces in the sense organs (such as the retinotopic, cochleo- 
topic and somatotopic mapping of visual, auditory and somatosensory 
cortices). However, when sensory features are mapped in a manner not 
inherited from the receptor surface, differences between species can be 
seen. For example, the visual cortices of carnivorans and primates, but 
not rodents, show a functionally organized and periodic arrangement of 
orientation preference®'” (Fig. 3a). This may be unavoidable in rodents 
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Figure 3 | Proposed relationship between feature mapping and coding in 
rodent visual cortex, cat visual cortex and rodent auditory cortex. a, Non- 
topographic arrangement of orientation preference in rodent V1 contrasts with 
spatially contiguous arrangement of orientation preference in cat V1 and of 
frequency preference in rodent A1. b, In all cases, principal cells receive inputs 
preferentially from neurons with similar feature preference. c, Incomplete 
functional specificity of connections for non-mapped features such as 
orientation in rodent V1 leads to broader subthreshold tuning. This is 
converted to sharp tuning of firing output by nonlinear neuronal amplification 
and synaptic inhibition. d, Inhibitory interneurons receive dense and 
unselective input from all neighbouring neurons. e, The inhibitory and 
excitatory synaptic inputs received by a principal cell are both strongest for the 
same preferred stimulus, but inhibition is more broadly tuned than excitation, 
leading to sharpened tuning of spiking output. 


given the small size of their visual cortex: if mice had orientation columns 
similar in size to carnivorans, they would be able to see only one orienta- 
tion in any part of the visual field. 

Differences in cortical maps lead to differences in the tuning of syn- 
aptic input for sensory features. Although connections in rodent primary 
visual cortex (V1) are biased to cells of similar orientation preference, this 
bias is not complete” (Fig. 3b). Accordingly, whole-cell recordings in 
individual neurons show substantial subthreshold depolarization also 
in response to non-preferred orientations***** (Fig. 3c). When features 
are mapped in a more orderly manner across the cortical surface, such as 
in iso-orientation domains of carnivoran visual cortex and iso-frequency 
bands in rodent auditory cortex, subthreshold tuning is sharper**®°, 
consistent with the dominance of local inputs of similar feature preference 
(Fig. 3b, c). In carnivoran V1, subthreshold tuning is also broader at 
‘pinwheel centres’ (where neighbouring neurons exhibit a diverse range 
of orientation preferences) than in iso-orientation domains™, again 
reflecting the importance of functional map organization in determining 
subthreshold tuning. 


REVIEW 


Despite broad subthreshold tuning, the spiking output of principal 
cells can still be highly selective for non-mapped features**. This seems 
to reflect a combination of nonlinear neuronal integration, which can 
convert subtle differences in membrane potential to large differences in 
spiking®’, and synaptic inhibition. Consistent with their nonspecific local 
connectivity, parvalbumin-expressing interneurons are selective for 
smoothly mapped features such as orientation in cat visual cortex®, tone 
frequency in rodent auditory cortex” and spatial position in rodent visual 
cortex’, but are unselective or broadly tuned for unmapped features such 
as orientation in mouse V1 (refs 26, 57-59) (Fig. 3d, e). In vivo patch- 
clamp experiments and models suggest that broadly tuned synaptic 
inhibition has an important role in producing sharp output selectivity 
for orientation in mouse V1 and in pinwheel centres of carnivoran V1 
(refs 64, 69). Thus, selective tuning of excitatory neurons can occur 
irrespective of whether a stimulus feature is mapped smoothly on the 
cortical surface, due to functional biases in connectivity, nonlinear neur- 
onal integration, and appropriately tuned inhibition. 

These data suggest two hypotheses regarding the relationship between 
connectivity and coding across areas and species. First, certain features of 
cortical connectivity appear to be preserved across multiple systems: 
principal cells receive functionally selective excitatory inputs both locally 
and from distant regions, while the connectivity of many interneurons is 
local but nonspecific. Second, although the selectivity of subthreshold 
responses of principal cells and spiking responses of local interneurons 
will depend on how a particular stimulus parameter is mapped across 
the cortical surface, the firing of principal cells—the cortex’s final output— 
is highly selective even without feature mapping. Indeed, unmapped 
orientation is found in even highly visual rodents such as squirrels”, 
suggesting mapping is not essential for accurate sensory function. 


Redundant coding in cortical populations 


Current experimental techniques allow recordings of up to a few hundred 
neurons simultaneously. One of the most notable findings from these 
experiments is that cortical population codes are redundant. The firing 
pattern of a neuronal population can be summarized (to a first approxi- 
mation) by the subset of neurons that are active at any moment. In 
principle, the number of potential subsets of even a moderately-sized 
neural population is vast. In practice however, only a much smaller number 
of firing patterns can actually occur. The existence of preserved con- 
straints on population activity indicates that cortical coding is redundant, 
as the firing of any one neuron can be partially inferred from the activity 
of its peers*’*”’”°. Redundant coding is found in many other domains: 
written language is redundant, for example, because only a small fraction 
of potential letter combinations spell actual words. This redundancy 
allows words to still be understood if letters are missing or misprinted. 

The nature of cortical redundancy can be studied by considering 
patterns of neuronal correlation. The firing of neighbouring neurons 
is often correlated, and the structure of correlations in a population is 
broadly similar when considering the mean responses to different stimuli 
(‘signal correlations’), the variability of responses to multiple presenta- 
tions of a single stimulus (‘noise correlations’), as well as activity occur- 
ring without sensory stimulation (‘spontaneous correlations’)**”°”’. This 
preservation of correlation structure suggests that cortical redundancy 
arises from constraints imposed by cortical wiring. Although individual 
pairwise correlations are typically weak (approximately <0.1), even weak 
pairwise correlations indicate strong constraints on the subsets of neu- 
rons that may be co-active at the population level”. 

A natural hypothesis is that the restricted subsets of neurons that can 
in practice fire together correspond to the interconnected subnetworks 
suggested by connectivity analyses. Although at present this remains 
conjecture, several pieces of evidence suggest it may be the case. Cor- 
relations are stronger between synaptically connected principal cells”, at 
least in visual L2/3. These correlations do not reflect the direct driving of 
one cell by the other, as synaptic connections between principal cells are 
typically weak (approximately <1 mV)'*”; rather, they indicate that 
synaptically connected neurons frequently fire together as part of larger 


7 NOVEMBER 2013 | VOL 503 | NATURE | 55 


©2013 Macmillan Publishers Limited. All rights reserved 


REVIEW 


neuronal subsets that provide correlated input to both. Both noise and 
signal correlations are higher between connected neurons, suggesting a 
direct relationship between connectivity and the constraints on co- 
active neuronal subsets”’. Finally, the structure of correlations matches 
that of connectivity patterns across layers and cell classes. For L2/3 
principal cells, correlations are typically weak and fall off steeply with 
anatomical distance***°”’, consistent with their local and functionally- 
specific connectivity profiles. Correlations among pairs of fast spiking 
interneurons and pairs of L5 principal cells are stronger and persist over 
larger distances**”°, consistent with denser and more widespread con- 
nectivity***°, and therefore larger fractions of shared input. 

Another important question is why the cortex might employ redund- 
ant coding. Redundant codes are inefficient, in that they require more 
neurons, more spikes, and thus more energy. Patterns of raw sensory 
input are redundant: much of the variance in natural image scenes, for 
example, reflects simple changes of luminance. Eliminating this redund- 
ancy seems to be a major function of the early sensory systems’*”, beg- 
ging the question of why redundancy would be reintroduced in cortex. 
We suggest two reasons. First, redundant codes are robust. Neuronal 
activity is noisy; furthermore, long-range neuronal connections are diver- 
gent, implying that distal targets receive inputs from only a fraction of the 
neurons in any given cortical area. Redundant coding allows the noisy 
output of a small fraction of cells to convey a message close to that 
encoded by the whole population. Second, cortical redundancy may 
reflect selective amplification of important sensory features. One might 
expect the number of neurons used to encode a given feature to match the 
relevance of that feature for later neuronal processing. The retina might 
remove redundancy imposed by luminance changes because luminance 
is not usually behaviourally important; cortex may create new redund- 
ancy by allocating large subnetworks to subtle but important sensory 
features, which thereby gain greater potential control of the animal’s 
behaviour. 


Top-down connections and contextual modulation 


In the natural environment, animals must combine sensory and non- 
sensory information in diverse and novel ways according to changing 
behavioural demands. Neurons of even primary sensory cortex do not 
just encode sensory input, but integrate it with contextual information 
such as reward”, expectation’”®”’, attention”® and motor action” *’. 

Most inputs to primary sensory cortical areas come not from primary 
thalamus, but from higher-order structures. These ‘top-down’ inputs 
can carry detailed information about diverse behavioural and cognitive 
variables*’. Top-down inputs project heavily to L1, where they synapse 
on interneurons, and on the apical tuft dendrites of principal cells. 
Although distal inputs may not alone drive principal cells to fire, they 
enable dendritic calcium spikes that increase the gain of a cell’s res- 
ponses to proximal synapses carrying sensory information***’. By tar- 
geting subnetworks encoding sensory features of particular contextual 
relevance, top-down connections may thus amplify their sensory res- 
ponses, and enhance contextually-appropriate behavioural outcomes. 
Consistent with top-down selection of subneworks, experiments in 
primate area V4 show that attention to specific locations or features 
amplifies population responses to corresponding stimuli®***. Furthermore, 
both the behavioural and neural correlates of spatial attention can be 
mimicked by electrical stimulation of a topographically-aligned area of 
the frontal eye fields***’, consistent with a role for top-down inputs in this 
amplification. 

Recent evidence from rodent cortex suggests that active behaviour can 
change the operating mode of sensory cortex in diverse ways, by modu- 
lating the activity of different interneuron classes. For example, active 
whisking hyperpolarizes somatostatin -expressing cells in superficial layers 
of barrel cortex"; this results from their inhibition by VIP-expressing 
interneurons, which are in turn excited by top-down afferents from motor 
cortex®’. In visual cortex, by contrast, locomotion increases the activity of 
somatostatin-expressing cells”’. As somatostatin-expressing cells inhibit 
apical tuft dendrites, these changes are in turn likely to differentially 
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impact the integration of top-down glutamergic input by principal cells. 
Understanding the way different active behaviours affect different neur- 
onal classes in different cortical areas is an important topic of current 
research. 


Outlook 


Recent progress has suggested how the coding strategies employed by 
several cortical cell classes may arise from their specific connectivity 
patterns. We end this review by outlining some of the major questions 
to be addressed in the near future, which will aided by the development 
emerging techniques. 

First, despite extensive research, we still have only a rudimentary 
understanding of the diverse classes of cortical excitatory and inhibitory 
neurons. The answer is likely to be complex, with developmental and 
epigenetic factors defining a hierarchy of ever more subtly differing 
subclasses. An unbiased classification of cortical cells may be provided 
not only by rapid progress in developmental neuroscience, but also by 
new technologies such as single-cell transcriptome sequencing. 

Second, the generic input-output connectivity profile of each cell class 
must be established. In addition to current methods such as multiple 
patch-clamp recordings in vitro, emerging techniques such as viral ret- 
rograde tracing”', large-volume electron microscopic reconstruction’*"’, 
mGrasp”, and CLARITY” will help this endeavour. In vivo recordings of 
identified cells will allow each class’s connectivity to be related to the 
coding strategy that it uses. 

Third, although it is now established that activity correlations are 
higher between synaptically connected neurons”, the relationship between 
larger-scale population codes and circuit connectivity is still unclear. One 
question is whether co-active principal cell subsets” really correspond to 
the interconnected subnetworks suggested by connectivity analyses'*"*. 
Answering this question will require large-scale connectomic analysis of 
populations whose activity has been previously recorded in vivo"”. 

A fourth question is how the top-down connectivity ofa cell class, and 
of an individual neuron, relates to its encoding of nonsensory variables. 
To answer this question will require large-scale recordings in behaving 
animals, together with anatomical and functional characterization of top- 
down inputs, for example, by post-hoc tracing, or imaging of axons” and 
individual synaptic inputs*®. 

Finally, it is important to consider how cortical connections and codes 
are shaped by experience and learning. The recent availability of gen- 
etically encoded Ca”* indicators for long-term imaging now makes it 
possible to relate changes in an individual neuron’s sensory responses to 
changes in its synapses as animals learn new information”. This tech- 
nique, in combination with techniques to genetically manipulate syn- 
aptic plasticity, should provide a powerful tool to study how connectivity 
and coding develop into a form appropriate for an animal’s experience. 
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Cooperation between brain and islet in 
glucose homeostasis and diabetes 


Michael W. Schwartz!, Randy J. Seeley’, Matthias H. Tschop’, Stephen C. Woods*, Gregory J. Morton’, Martin G. Myers? 
& David D’ Alessio” 


Although a prominent role for the brain in glucose homeostasis was proposed by scientists in the nineteenth century, 
research throughout most of the twentieth century focused on evidence that the function of pancreatic islets is both 
necessary and sufficient to explain glucose homeostasis, and that diabetes results from defects of insulin secretion, action 
or both. However, insulin-independent mechanisms, referred to as ‘glucose effectiveness’, account for roughly 50% of 
overall glucose disposal, and reduced glucose effectiveness also contributes importantly to diabetes pathogenesis. 
Although mechanisms underlying glucose effectiveness are poorly understood, growing evidence suggests that the 
brain can dynamically regulate this process in ways that improve or even normalize glycaemia in rodent models of 
diabetes. Here we present evidence of a brain-centred glucoregulatory system (BCGS) that can lower blood glucose 
levels via both insulin-dependent and -independent mechanisms, and propose a model in which complex and highly 
coordinated interactions between the BCGS and pancreatic islets promote normal glucose homeostasis. Because activation 
of either regulatory system can compensate for failure of the other, defects in both may be required for diabetes to develop. 
Consequently, therapies that target the BCGS in addition to conventional approaches based on enhancing insulin effects 


may have the potential to induce diabetes remission, whereas targeting just one typically does not. 


diabetes (T2D) represents one of the most pressing and costly 

biomedical challenges confronting modern society'’. However, 
much about the pathogenesis of these disorders remains unknown. In 
this article, we review recent evidence for a BCGS that works in tandem 
with pancreatic islets to regulate blood glucose levels. Glucose lowering 
induced by BCGS activation can involve a variety of mechanisms, some 
of which depend on insulin whereas others are altogether independent 
of islet hormones. Although islet- and brain-centred systems are distinct 
entities, evidence suggests that they work cooperatively to maintain stable 
blood glucose levels across a range of homeostatic challenges. Moreover, 
each system seems to have the potential to compensate, at least partially, 
for the failure of the other. Consequently, defects in both systems may be 
required for diabetes to develop and/or progress. This redundancy of islet- 
and brain-centred glucoregulatory systems presumably ensures tight 
regulation of circulating glucose, the body’s principal metabolic currency. 


T he escalating epidemic of obesity, metabolic syndrome and type 2 


Historical perspective 


On the basis of his observation in 1854 that diabetes could be induced in 
rabbits by puncturing the floor of the fourth-cerebral ventricle (‘piqire 
diabetique’)’, the renowned physiologist Claude Bernard proposed a 
role for the brain in both glucose homeostasis and diabetes pathogenesis. 
This notion remained popular until the discovery of insulin in 1921, and 
the subsequent identification of liver, muscle and adipose tissue as prin- 
cipal targets of the powerful effects of insulin on glucose metabolism. 
Combined with evidence linking diabetes pathogenesis to defective insu- 
lin secretion and action’, the pancreatic islet quickly came to overshadow 
the brain as the focal point for understanding this disease (Box 1). 
Current diabetes treatment options reflect this islet-centred view, con- 
sisting principally of recombinant human insulin preparations, insulin 
secretagogues (some of which also inhibit glucagon secretion), and drugs 
that increase insulin sensitivity. These drugs enjoy wide use and are 


effective in controlling hyperglycaemia, the hallmark of T2D, but they 
address the consequences of diabetes more than the underlying causes, 
and thus control rather than cure the disease. 

Although insulin-independent mechanisms contribute nearly as much 
to glucose disposal as insulin does, little is known about how this type of 
glucose lowering works or what its therapeutic potential might be. Recent 
work indicates that BCGS activation can markedly improve glucose 
homeostasis in rodent models of diabetes via largely insulin-independent 
mechanisms’, and the possibility has been raised that a similar mech- 
anism contributes to diabetes remission®’ induced by bariatric surgical 
procedures such as Roux-en-Y gastric bypass*"’. Reconsideration of how 
glucose homeostasis is achieved by the body and the respective roles 
played by islet and brain in this process therefore seems justified. 


Brain control of glucose homeostasis 


A large literature documents glucoregulatory effects of pharmacological 
or genetic interventions targeting neurons in any of several areas of 
the hypothalamus (arcuate, ventromedial and paraventricular hypotha- 
lamic nuclei) and brainstem. Although uncertainty surrounds both the 
molecular identity and the role in glucose homeostasis played by many 
of these neuronal groups’, injection of insulin or glucose into discrete 
hypothalamic areas can lower blood glucose levels and increase liver 
insulin sensitivity’*'®, and similar effects are achieved by restoring func- 
tional leptin receptors to specific hypothalamic nuclei of animals that 
otherwise lack them'”'*. Conversely, deletion of receptors for either insu- 
lin or leptin (or their downstream signalling intermediates) from defined 
hypothalamic neurons causes glucose intolerance and systemic insulin 
resistance, indicating a physiological role for these neurons in the control 
of glucose metabolism’””®. These and many other observations highlight 
how the brain can influence glucose homeostasis in response to afferent 
input from peripheral signals, but they have yet to establish the extent to 


1Diabetes and Obesity Center of Excellence, Department of Medicine, University of Washington, Seattle, Washington 98109, USA. Department of Medicine, University of Cincinnati, Cincinnati, Ohio 45237, 
USA. 2Institute of Diabetes and Obesity, Helmholtz Zentrum Munchen & Division of Metabolic Diseases, Department of Medicine, Technische Universitat Minchen, Munich 85764, Germany. “Department 
of Psychiatry, University of Cincinnati, Cincinnati, Ohio 45237, USA. Department of Physiology, University of Michigan, Ann Arbor, Michigan 48105, USA. 
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BOX | 


which such responses participate in the physiological control of circulat- 
ing glucose levels. 


Indirect control of hepatic glucose production 


Although there is little doubt that insulin regulates hepatic glucose pro- 
duction (HGP) through a direct action on hepatocytes, insulin has also 
been proposed to regulate HGP via an indirect mechanism involving insu- 
lin action at a remote site”'. As aberrant control of HGP is fundamental to 
diabetic hyperglycaemia”, its regulation has important clinical impli- 
cations. The direct action of insulin on hepatocytes and other cell types 
involves its binding to insulin receptors and activation of signal transduc- 
tion cascades that regulate a wide range of cellular processes. Of particular 
relevance to glycaemic control is the canonical insulin receptor substrate- 
phosphatidylinositol-3-OH kinase (IRS-PI(3)K) pathway (Fig. 1), which 
mediates insulin inhibition of both glycogenolysis and gluconeogenesis, 
the two primary determinants of HGP”. 

In the fasted state, when the intestine is not absorbing nutrients and 
insulin levels are low, the liver is the primary source of circulating glucose 
and the rate of HGP is high. After a meal, nutrient-induced insulin secre- 
tion and subsequent activation of hepatic IRS—PI(3)K signalling inhibits 
HGP. The cellular basis for this effect involves PI(3)K-mediated activa- 
tion of Akt, a serine-threonine kinase that, among other actions, inhibits 
the transcription factor FOXO1. FOXOI stimulates gluconeogenesis in 
hepatocytes, and its inhibition is mandatory for insulin suppression of 
HGP". Insulin activation of the canonical IRS-PI(3)K pathway in hepa- 
tocytes is implicated in the control of HGP by insulin under physiological 
conditions, and previous studies”* offer clear evidence in support of this 
hypothesis. 

The concept that HGP can also be controlled by insulin action at a 
remote site was first proposed more than 15 years ago” and received 
compelling support in a recent study” of “TLKO’ mice with hepatocytes 
unresponsive to insulin owing to liver-specific deletion of key signal 
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Box 1 Figure | The traditional, islet-centred model of normal and abnormal glucose homeostasis. a, Under normal conditions, the islet-centred 
model proposes that glucose homeostasis is controlled primarily by the effect of rising blood glucose levels to stimulate insulin secretion. Insulin then 
acts on peripheral tissues such as the liver to suppress hepatic glucose production (HGP), and adipose tissue and muscle to stimulate glucose uptake. 
Not shown is the effect of the islet hormone glucagon, secretion of which is inhibited by rising glucose levels, and which acts to stimulate HGP. Thus, 
glucose has opposing actions on the secretion of insulin and glucagon, hormones that in turn have opposing effects on HGP. When blood glucose 
levels increase (for example, during a meal), therefore, the islet response effectively returns it to baseline. b, When individuals with normal islet function 
become insulin-resistant (for example, in association with dietary and/or genetic factors that cause obesity), the islet-centred model proposes that 
glucose homeostasis is preserved by the capacity of the islet to increase insulin secretion in a compensatory manner. ¢, If islet dysfunction precludes 
the increase of insulin secretion needed to overcome insulin resistance, glucose intolerance results. As islet dysfunction progresses, increased HGP 
and reduced tissue glucose uptake eventually cause overt hyperglycaemia and diabetes. 
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transduction molecules (the two Akt isoforms as well as FOXO1). In 
these animals, insulin cannot directly regulate HGP via the Akt-FOXO1 
pathway. However, rather than exhibiting the expected loss of regulation, 
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Figure 1 | Insulin signal transduction. In hepatocytes, insulin regulates HGP 
by activating the IRS—PI(3)K pathway, which inhibits the transcription factor 
FOXO1. FOXO1 activation increases gluconeogenesis, leading to increased 
hepatic glucose production (HGP). PIP2, phosphatidylinositol-4,5- 
bisphosphate; PIP3, phosphatidylinositol-3,4,5-triphosphate. 
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both HGP and systemic glucose homeostasis are controlled normally in 
these mice, even in response to exogenous insulin. 

These data are among several observations”! that point to the exist- 
ence of an indirect pathway through which insulin and nutrients can 
regulate HGP even when hepatocytes themselves are insensitive to direct 
insulin action. An intriguing question is what mechanism mediates the 
indirect control of HGP by insulin. Although other explanations are 
possible**”’, the BCGS is both activated by insulin and capable of regu- 
lating HGP in humans” as well as rodent models'””*. 

Comparison of the phenotypes of the previously reported TLKO mice 
with liver-specific insulin receptor knockout (LIRKO) mice, in which the 
liver is also unable to respond to insulin”’, is informative. Unlike TLKO 
mice that have generally normal glycaemic regulation, LIRKO mice are 
severely glucose intolerant and insulin resistant. This is because LIRKO 
mice have unrestrained and unregulated HGP because FOXO1 is con- 
stitutively active in the absence of insulin-stimulated Akt activation. In 
LIRKO mice, therefore, increased HGP results from excessive FOXO1 
activity, removal of which enables normal control of HGP via the indirect 
pathway. From this conclusion we infer that although inhibition of HGP 
by the indirect insulin pathway involves a FOXO1-independent mech- 
anism, it can be blocked by excessive FOXO] signalling. 
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Insulin-independent glucose disposal by the BCGS 


Although a large literature has established the brain’s capacity to affect 
glucose homeostasis, it has become clear only recently that this can 
involve mechanisms that are independent of insulin. Studies in which 
leptin was infused directly into brain ventricles—at doses too low to have 
any effect outside the brain—of rats and mice with insulin-deficient 
diabetes clearly demonstrate the ability of central leptin action to nor- 
malize markedly increased blood glucose levels*”*' despite persistent, 
severe insulin deficiency. This surprising outcome is incompatible with 
a strictly islet-centric model of glucose homeostasis. Similar findings 
have been reported in other rodent models of insulin-deficient diabetes 
using systemic (rather than central) administration of leptin at supra- 
physiological doses**”?. 

Normal glucose tolerance, the ability to clear glucose from the blood- 
stream after a systemic glucose load, is believed to involve wide-ranging 
and highly coordinated effects of insulin across many different tissues. 
Thus, it seems surprising that in addition to normalizing fasting plasma 
glucose levels, intracerebroventricular (ICV) leptin infusion also restores 
glucose tolerance to nearly normal levels in rats with uncontrolled insulin- 
deficient diabetes*’. Leptin action in the brain can therefore orchestrate 
complex and interconnected processes across several tissues to lower 
blood glucose despite the absence of insulin signalling. Although mecha- 
nisms mediating this effect are still under investigation, normalization of 
HGP, along with increased glucose uptake in tissues such as skeletal 
muscle, heart and brown adipose tissue, have a role”’. 

If activation of the BCGS by exogenous leptin is sufficient to correct 
diabetes without the need for insulin, why does severe insulin deficiency 
cause uncontrolled diabetes if the BCGS is left undamaged? The answer 
may lie in the extensive overlap between peripheral and central glucor- 
egulatory systems. Insulin is required for the proper functioning of many 
cells and organ systems, including adipose tissue, and states of severe 
insulin deficiency undermine the ability of adipocytes both to store cal- 
ories as fat and to secrete leptin. Consequently, severe insulin deficiency 
begets severe leptin deficiency”, depriving the BCGS of two key inputs 
and thereby undermining its function. Importantly, this defect is revers- 
ible by activating leptin receptors exclusively in the brain, as evidenced 
by the ability of central leptin infusion to restore normal glucose home- 
ostasis to animals with uncontrolled, insulin-deficient diabetes*®. Of course, 
uncontrolled diabetes can also be reversed by systemic insulin treatment, 
but this normalizes plasma levels of leptin as well as insulin**. Restoring 
normal leptin levels is important, because in the absence ofa leptin signal 
(for example, in lipodystrophy or other leptin-deficient conditions), con- 
trol of hyperglycaemia is much more difficult than in other forms of 
diabetes”. 
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Although physiological leptin replacement blocks or attenuates many 
neuroendocrine responses induced by insulin-deficient diabetes, it does 
not normalize hyperglycaemia**. This finding suggests that in the absence 
of insulin, supraphysiological activation of the BCGS is necessary to 
restore euglycaemia, and delivery of leptin to the brain in supraphysio- 
logical amounts achieves this effect. Thus, just as compensation for leptin 
deficiency requires insulin concentrations well above the normal physio- 
logical range (for example, in lipodystrophy or ob/ob mice, in which 
diabetes develops despite profound hyperinsulinaemia), high leptin levels 
are required to compensate for severe insulin deficiency. Stated differ- 
ently, although islet- and brain-centred control systems are each able to 
compensate for the failure of each other, the activity of either system must 
be amplified for full compensation to occur. Unfortunately, islet failure 
does not trigger compensatory BCGS activation, but rather has the 
opposite effect, leading to a vicious cycle that ends in hyperglycaemia. 

The use of the term ‘insulin independent to refer to actions mediated 
by the BCGS that become dysfunctional in the face of islet failure is a 
potential source of confusion. This is because if we accept that normal 
operation of the BCGS (including production of leptin by adipocytes) 
depends on insulin, one can argue that the entirety of glucose home- 
ostasis is ‘insulin dependent’, even those effects mediated by the BCGS 
that do not involve a direct effect of insulin to stimulate glucose uptake. 
To avoid this confusion, we use ‘insulin independent’ hereafter to 
refer to effects on tissue glucose metabolism that do not involve direct, 
insulin-mediated signal transduction. 

Recent evidence indicates that hormones other than leptin can also 
act in the brain to promote insulin-independent glucose lowering. Like 
insulin, the gastrointestinal hormone FGF19 (or its rodent homologue, 
FGF15) is secreted in response to meals, and, when given at pharmaco- 
logical doses, exerts potent anti-diabetic effects*’. Glucose-lowering by 
FGF19 involves actions in liver and adipose tissue, but the brain is also 
implicated, as ICV administration of FGF19 improves glucose tolerance 
in obese rats**. To investigate the mechanism underlying centrally 
mediated glucose lowering by FGF19, a study was recently performed 
in genetically obese, leptin-deficient ob/ob mice’. Within 2h of a single 
ICV injection of FGF19 (at a dose causing no glucose lowering when 
given peripherally), ob/ob mice displayed markedly improved glucose 
tolerance, despite no change in insulin secretion or sensitivity’. Instead, 
the glucose-lowering effect of ICV FGF19 resulted from a selective, 
threefold increase in the insulin-independent component of glucose 
disposal. In response to diverse hormonal stimuli, therefore, the brain 
has the inherent capacity to remedy diabetic hyperglycaemia and glu- 
cose intolerance via potent, insulin-independent mechanisms’, as well 
as through enhanced insulin sensitivity’? ”°. 


Glucose effectiveness 


The term glucose effectiveness (GE) refers to the effect of an increased 
concentration of glucose to promote its own disposal, independent of 
insulin action”. Insulin-independent glucose disposal also occurs at basal 
glucose levels, but our understanding of the underlying mechanisms is 
insufficiently advanced to know whether the same or distinct processes 
contribute when plasma glucose levels are high versus in the basal state. In 
accord with convention, therefore, we use the term ‘insulin-independent 
glucose disposal’ to refer to the overall process, including those that oper- 
ate at basal glucose levels, and reserve the use of ‘GE’ to refer to insulin- 
independent glucose disposal when blood glucose levels are increased. 
A key point is that insulin-independent glucose disposal makes a large 
contribution to overall glucose homeostasis, roughly comparable to that 
of insulin®. Combined with the fact that reduced GE is both a major 
contributor to obesity-associated glucose intolerance*”** and a strong 
risk factor for the future development of T2D (ref. 40), it is surprising 
how little is known about it. Unlike the dynamic and physiologically 
important regulation that characterizes insulin secretion and action, 
insulin-independent glucose disposal has traditionally been viewed as 
the fixed and unregulated process through which insulin-independent 
tissues obtain glucose to meet their needs*”*'. The mechanism typically 
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invoked to explain insulin-independent glucose disposal involves the 
passive effect of an increased glucose level to drive its movement down a 
concentration gradient and into cells (termed glucose mass action), but it 
is now clear that other mechanisms also exist—mechanisms that are 
subject to rapid regulation and can profoundly affect glucose homeostasis. 

Perhaps the best-documented and most obvious example of rapid 
regulation of insulin-independent glucose disposal is in response to 
physical exercise, with the heightened metabolic demands of exercising 
muscle stimulating glucose uptake in the presence of stable ambient 
insulin and glucose levels. In addition to exercise, rapid regulation of 
GE has been reported in response to hormonal stimulation, for example, 
during intravenous infusion of glucagon-like-peptide-1 (GLP-1). Although 
GLP-1 improves glucose tolerance by enhancing insulin secretion, it also 
increases GE via mechanisms that have yet to be studied”. Interestingly, 
GLP-1 action in the hypothalamic arcuate nucleus also improves glucose 
tolerance®’, raising the untested possibility that its effects on GE (like 
those of leptin and FGF19) are centrally mediated. 

Extending this reasoning, it is noteworthy that, by definition, GE increases 
in response to rising blood glucose levels, and that glucose action on arcuate 
nucleus neurons has a rapid glucose-lowering effect'®. Collectively, these 
observations support a model in which, by increasing plasma concentra- 
tions of insulin, GLP-1, FGF19 and glucose, consuming a meal generates 
diverse signals that activate the BCGS. This BCGS activation then con- 
tributes to glucose disposal via stimulation of both insulin-dependent and 
-independent mechanisms that, together with islet responses, are essen- 
tial for proper glucose handling by the body (Fig. 2). 

If insulin-independent glucose disposal is subject to rapid and potent 
regulation by the brain, it is not clear why neural control of GE has not 
been detected previously. One explanation may be that previous studies 
have relied on methods that are not optimized to detect GE. Chief among 
these is the euglyaemic—hyperinsulinaemic clamp method, considered by 


Islet-centred 
control system 


Pancreas Gl tract 


many to be the gold standard for quantitative, in vivo assessment of glucose 
metabolism. With this method, insulin sensitivity is measured as the amount 
of exogenous glucose that must be infused to maintain stable (or ‘clamped’) 
blood glucose concentrations when insulin levels are raised. Conse- 
quently, experimental interventions that change the amount of glucose 
required during the clamp are interpreted as having changed insulin 
sensitivity, despite the fact that some of the infused glucose could have 
been disposed of by insulin-independent mechanisms. Thus, one cannot 
know with certainty the extent to which observations based on the clamp 
method are due to changes in insulin-independent glucose disposal 
instead of, or in addition to, changes of insulin sensitivity. This limitation 
can be addressed using a complementary approach based on minimal 
model analysis of glucose and insulin kinetics during an intravenous 
glucose tolerance test. This method has seen broad use in clinical 
research*”*** and was recently used to reveal the potent stimulatory 
effect of centrally infused FGF19 on GE in ob/ob mice’. 


A physiological role for the BCGS 

Although there is little question that the brain participates in the glu- 
coregulatory response to emergent or stressful conditions (for example, 
hypoglycaemia), the notion that the BCGS acts together with the islet to 
control glucose homeostasis under physiological conditions has yet to 
gain broad acceptance. A common and appropriate criticism is that 
although brain-directed interventions can affect glucose homeostasis, 
this should not be taken as evidence that the brain has a physiological 
role. Although the question of whether the BCGS is vital for normal, 
day-to-day control of blood glucose levels remains unanswered, several 
recent observations—that an indirect pathway controlling HGP exists 
and that this pathway can support normal glucose homeostasis even 
when the liver cannot respond to insulin directly’, that BCGS activation 
can be rapidly and potently engaged to increase insulin-independent 
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Figure 2 | Schematic illustrations of brain- and islet-centred 

glucoregulatory systems. The BCGS is proposed to regulate tissue glucose 
metabolism and plasma glucose levels via mechanisms that are both insulin 
dependent (for example, by regulating tissue insulin sensitivity) and insulin 
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independent. Because of extensive redundancy between islet- and brain- 
centred pathways, dysfunction of both may be required for T2D to develop, and 
diabetes remission may be possible with therapies that target both pathways. 
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glucose disposal, and that doing so has the potential to treat diabetic 
hyperglycaemia—justify reconsideration of this hypothesis and call for 
studies that offer a definitive test. 

Among factors that engender scepticism of the part played by the 
BCGS is the notion that an islet-centred model (Box 1) is sufficient to 
explain the physiological control of glucose homeostasis under usual 
circumstances, because the capacity of the islet to secrete insulin in 
response to rising glucose levels often compensates for centrally mediated 
effects. For example, although even subtle disruption of the BCGS (for 
example, deletion of insulin receptors from a distinct subset of hypotha- 
lamic neurons in mice’’”°) can reduce liver insulin sensitivity, the effect 
on glucose homeostasis is minimal because the tendency for blood glu- 
cose levels to rise is offset by a compensatory increase of insulin secretion. 
However, the logic of this argument weakens if BCGS dysfunction is 
more advanced. As an example, leptin-deficient states such as lipodystro- 
phy increase both HGP and blood glucose levels despite marked hyper- 
insulinaemia”. Thus, impairments of both BCGS and islet function exist 
along a spectrum that ranges from mild to severe and, although the 
capacity of the islet to compensate for BCGS impairment is substantial, 
it has its limits. Normal BCGS function can therefore be seen as being 
permissive for normal glucose homeostasis, with islet compensation lim- 
iting the effect of BCGS dysfunction when it is mild, but not when it is 
more advanced. 

A paucity of mechanistic information is another factor that has lim- 
ited acceptance of a physiological role for the BCGS in glucose home- 
ostasis. Whereas a great deal is known about how cellular insulin action 
affects glucose metabolism in hepatocytes (Fig. 1), for example, much 
less is known about how the brain controls HGP. Although a role for 
hepatic vagal innervation has been suggested'*""’, it is premature to 
invoke the vagus nerve as the predominant mediator of this effect*’. 
Another mechanism that may be relevant involves the islet hormone 
glucagon, which stimulates hepatic gluconeogenesis and gycogenolysis 
and hence raises HGP. Increased glucagon levels are implicated in the 
hyperglycaemia of uncontrolled, insulin-deficient diabetes, because 
both HGP and plasma glucagon levels are raised in this setting. 

In this context, the interaction between leptin and glucagon is of 
interest. First, because leptin normalizes both HGP and increased glu- 
cagon levels in rodents with uncontrolled diabetes”, leptin-mediated 
inhibition of HGP may involve normalization of increased glucagon 
levels. Interestingly, leptin-mediated inhibition of glucagon secretion 
seems to be centrally mediated, because the effect is observed regardless 
of whether leptin is given systemically (at a high dose) or by ICV injec- 
tion (at a low dose)*°*’. Furthermore, the effect of uncontrolled diabetes 
to increase both glucagon secretion and HGP seems to be triggered, at 
least in part, by leptin deficiency, because both are reversed by leptin 
treatment*”****. However, increased plasma glucagon levels were normal- 
ized by systemic administration of a physiological dose of leptin to rats 
with streptozotocin-induced diabetes mellitus, and yet hyperglycaemia 
did not substantially improve’’. Thus, the extent to which leptin-mediated 
normalization of circulating glucagon levels mediates its glucose-lowering 
effects in this setting awaits further study. 

Further insight into the physiological role the BCGS has in glucose 
homeostasis can be gleaned from the hepatic response to a nutrient 
challenge. After a meal (or in response to a glucose load), the liver switches 
from being a net producer to a net consumer of glucose, and a surprisingly 
large fraction of the glucose absorbed during a meal is taken up into the 
liver*’. This response is triggered by rising glucose concentrations in the 
hepatic portal vein (the vessel into which ingested nutrients enter before 
gaining access to the systemic circulation), which seems to be sensed by the 
BCGS". Activation of the BCGS in turn strongly enhances liver glucose 
uptake via a mechanism that is augmented by insulin action in the brain”. 

Several key questions remain to be addressed. One is whether the regu- 
lated component of insulin-independent glucose disposal is required for 
normal glucose homeostasis (a possibility that seems likely, given its 
considerable involvement), and if so, another is whether intact BCGS 
function is required for normal GE. Affirmative answers to both questions 
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would constitute indisputable evidence that the brain has a physiological 
role in glucose homeostasis—perhaps comparable to that played by the 
islet, which itself is subject to regulation by the brain’? **"*. 


Two-system control of glucose homeostasis 


On the basis of the above reasoning, we propose that in response to a 
meal, both islet- and brain-centred systems are engaged and have import- 
ant roles to restore homeostasis (Box 2). As ingested nutrients are 
absorbed into the circulation, increased insulin secretion and its canon- 
ical action on muscle, fat and liver both promote glucose disposal and 
inhibit its endogenous production. At the same time, the recruitment 
of insulin-independent mechanisms, in part through BCGS activation, 
makes a contribution to the overall process comparable to that of insulin. 
Like the action of insulin, these insulin-independent effects serve to both 
enhance glucose disposal (for example, through increased liver glucose 
uptake) and inhibit glucose production. 

After a meal, the contributions made by insulin-dependent and 
-independent mechanisms to the overall process are roughly equal, 
reflecting a partnership between direct, peripheral tissue effects of insulin 
and BCGS activation that ensures the efficient return of increased plasma 
glucose levels to basal values (Fig. 2). This two-system model incorporates 
interactions between the BCGS and islet-based systems into physiological 
glucose homeostasis via coordinate regulation of insulin-dependent and 
-independent mechanisms. 


Is diabetes a failure of two systems? 


In addition to establishing that the brain can potently increase GE, the 
observation that ICV leptin administration normalizes hyperglycaemia 
in rodents with uncontrolled diabetes indicates that BCGS activation 
can compensate effectively for severe insulin deficiency. This conclusion 
in turn suggests that disorders of both islet- and brain-centred systems 
may be necessary for T2D to occur (Fig. 3). This hypothesis is compat- 
ible with the observation that loss of canonical insulin action in specific 
tissues (for example, liver) has little effect on glucose homeostasis”, and 
that reduced GE contributes importantly to hyperglycaemia in T2D 
(ref. 39). But what is the evidence that BCGS function is impaired in 
individuals with diabetes? To our knowledge, there are no established 
examples in which diabetes occurs in the absence of BCGS dysfunction. 
Diabetes and BCGS dysfunction are tightly coupled to one another 
because (1) proper BCGS function depends on normal islet function, 
relying on inputs from insulin as well as other hormones whose secre- 
tion is either dependent on islet function (for example, leptin) or defec- 
tive in diabetes (for example, GLP-1), and (2) rodent models of obesity 
and T2D are associated with hypothalamic injury and gliosis, a poten- 
tially important cause of BCGS dysfunction’”*'. These hypothalamic 
alterations are proposed to reduce the ability of the BCGS to respond to 
relevant humoral signals (including insulin as well as leptin), and hence 
contribute to the associated fall of GE and onset of systemic insulin 
resistance that places an increased demand on islets in the lead up to 
T2D (Fig. 3). Whether this form of hypothalamic injury also occurs in 
human hypothalamus is under investigation, and early data support this 
possibility’”°*. Thus, hypothalamic injury or inflammation offers a 
plausible mechanism linking impairment of the BCGS to T2D patho- 
genesis, and studies to test this hypothesis critically are warranted. 


Prospects for diabetes remission 

Beyond causing weight loss, bariatric surgery induces diabetes remission 
ina far higher percentage of cases than can be achieved with conventional 
medical therapy®”"°. The mechanism underlying metabolic benefit con- 
ferred by bariatric procedures is incompletely understood but may involve 
improvements of both islet- and brain-centred glucoregulatory systems. 
A previous study in a model of bariatric surgery (‘duodenal exclusion’) 
showed that blood glucose levels could be normalized in diabetic rats via 
insulin-independent activation of a neural circuit that inhibits HGP*. 
Using a similar surgical model, another study’? demonstrated that regu- 
lation of HGP after this procedure requires neuronal glucose sensing in 
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diabetes remission, whereas targeting just one system typically does not. 


Box 2 Figure | Model integrating the BCGS and islet-centred system in normal and abnormal glucose homeostasis. a, Under normal conditions, 
glucose homeostasis is controlled by complex and highly coordinated interactions between 
senses a variety of humoral signals, and in response to these inputs, BCGS activation increases glucose disposal by both insulin-dependent (for 
example, by increasing tissue insulin sensitivity) and insulin-independent (by increasing GE, which accounts for ~50% of overall glucose disposal%?) 
mechanisms. b, Although insulin normally inhibits HGP through its direct action on the liver, an indirect pathway also exists through which insulin can 
preserve normal HGP and blood glucose levels even when hepatocytes cannot respond to insulin directly°. We propose that this is among the effects 
mediated by the BCGS.c, Obesity is associated with reduced GE? and with insulin resistance, and BCGS dysfunction contributes to both. When BCGS 
dysfunction is mild, the resulting tendency for blood glucose levels to increase stimulates insulin secretion, such that glucose homeostasis is 
preserved (at the expense of higher insulin levels). When BCGS dysfunction is more severe, however, even marked hyperinsulinaemia cannot preserve 
normal glucose homeostasis*°, owing in part to the inability of reduced GE to be compensated by increased insulin secretion. Thus, intact BCGS 
function is required for normal glucose homeostasis. d, Islet dysfunction is not compensated by BCGS activation; to the contrary, impaired islet 
function can itself impair BCGS function (by reducing secretion of leptin as well insulin, when islet damage is severe) creating a vicious cycle that 
results initially in glucose intolerance. As both BCGS and islet dysfunction progress, overt hyperglycaemia and T2D result. e, Islet dysfunction can be 
compensated for by supraphysiological BCGS activation, which can achieve near-normal glucose homeostasis in rodent models of diabetes via 
insulin-independent mechanisms. Thus, therapeutic interventions targeting the BCGS as well as the traditional islet-based system may achieve 
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the hepatic portal bed, and recent work indicates that despite having no 
effect on weight loss, body composition, food intake or energy expend- 
iture™’, sub-diaphragmatic vagotomy blocked the effect of bariatric sur- 
gery to reduce HGP in a rat model of obesity’. Furthermore, recent work 
suggests that insulin signalling in the ventromedial hypothalamus is 
required for the effect of bariatric surgery to inhibit HGP in an obese 
rat model**. Although mechanisms underlying BCGS activation by bar- 
iatric surgery await further study, recent evidence offers a link between 
enhanced secretion of FGF19, the nervous system and the gastrointestinal 


tract**. The larger point is that, should metabolic benefit arising from 
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bariatric procedures be shown to involve BCGS activation, this would 
in turn suggest that diabetes remission may be achievable through inter- 
ventions that activate both islet- and brain-centred glucoregulatory sys- 
tems, whereas targeting just one does not. In principle, achieving this goal 
should not require surgical manipulation of the gastrointestinal tract. 


Conclusion 

When Claude Bernard proposed a dominant role for the brain in glucose 
homeostasis and diabetes pathogenesis, it was not the radical notion that 
it seems to be today. After all, the brain is implicated in the homeostatic 
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point is that drugs that target the BCGS have important potential to 
synergize with current, islet-based approaches in ways that may fun- 
damentally improve the management of what is among the most com- 
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Figure 3 | Proposed contributions of defective brain- and islet-centred 
glucoregulatory systems to T2D pathogenesis. The traditional view holds 
that diabetes arises as a consequence of damage to, and ultimately failure of, 
beta-cell function. We propose a two-component model in which failure of 
glucose homeostasis can begin after initial impairment of either pancreatic 
islets or the BCGS. Malfunction of either of the two systems can initiate a 
cascade that drives the remaining glucoregulatory system into failure over time. 
Only when both systems are compromised does diabetes develop. 
Consequently, interventions that target both systems have greater therapeutic 
potential than those that target just one system. 


control of most physiological processes that are essential for survival, 
ranging from body fuel stores (for example, fat mass) and body temper- 
ature to blood pressure and many endocrine systems. From this per- 
spective, it seems surprising that control of glucose homeostasis should 
be governed entirely by peripheral mechanisms, despite some 90 years of 
research that has focused more or less exclusively on this view. One 
wonders how this area of science would have developed if leptin and 
its ability to normalize glucose levels in uncontrolled diabetes had been 
discovered in 1921, rather than insulin. 

The surprise with which recent demonstrations of brain-mediated, 
insulin-independent correction of diabetes have been greeted brings 
into bold relief how, in the years since the discovery of insulin, metabolic 
research has been focused on only one part of the system governing 
glucose homeostasis —the part involving pancreatic islets. Despite hav- 
ing a role that may be comparable to that of insulin, insulin-independent 
glucose disposal has until now been seen as phenomenological and less 
worthy of study, and the notion that it might be regulated by the brain 
has not been previously considered. 

Looking to the future, there are several important fundamental ques- 
tions to address. Before the broader scientific community can (or should) 
be expected to embrace a role for the brain comparable to that of the islet 
in the day-to-day control of blood glucose levels, studies are needed to 
determine whether the maintenance of normal GE, which is known to be 
required for normal glucose tolerance, is dependent on a properly func- 
tioning BCGS. A related and equally important question is whether the 
link between reduced GE and the development of T2D (ref. 40) is 
explained by BCGS dysfunction. Such a finding would offer direct evid- 
ence that failure of both the BCGS and the islet is integral to diabetes 
pathogenesis. 

Lastly, the observation that hormones such as FGF19 can act in the 
brain to improve glucose homeostasis in animal models of diabetes 
identifies new avenues for diabetes drug development. To expand on 
this specific example, the mechanism underlying the central action of 
FGF19 is proposed to involve a specific FGF receptor subtype, FGFRIc, 
that is widely expressed in the brain. In principle, there is no reason why 
synthetic agonists of this receptor should not prove effective for glucose 
lowering in patients with diabetes. Indeed, the efficacy of such drugs 
may not rely entirely on their central action, because activating this 
receptor in peripheral tissues seems to also be beneficial*’. The larger 
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The effect of anthropogenic aerosols on cloud droplet concentrations and radiative properties is the source of one of the 
largest uncertainties in the radiative forcing of climate over the industrial period. This uncertainty affects our ability to 
estimate how sensitive the climate is to greenhouse gas emissions. Here we perform a sensitivity analysis on a global 
model to quantify the uncertainty in cloud radiative forcing over the industrial period caused by uncertainties in aerosol 
emissions and processes. Our results show that 45 per cent of the variance of aerosol forcing since about 1750 arises from 
uncertainties in natural emissions of volcanic sulphur dioxide, marine dimethylsulphide, biogenic volatile organic carbon, 
biomass burning and sea spray. Only 34 per cent of the variance is associated with anthropogenic emissions. The results 
point to the importance of understanding pristine pre-industrial-like environments, with natural aerosols only, and 
suggest that improved measurements and evaluation of simulated aerosols in polluted present-day conditions will not 
necessarily result in commensurate reductions in the uncertainty of forcing estimates. 


The impact of aerosol changes on cloud albedo (called the aerosol first 
indirect forcing)' is estimated’ to exert a global mean radiative forc- 
ing of climate over the industrial period between —0.4Wm ~ and 
—1.8Wm 7. Other aerosol-cloud interaction effects, involving rapid 
adjustments, may be of comparable magnitude* but their radiative 
effects are even less well understood on a global scale**. The uncer- 
tainty in the aerosol forcing is much larger than the uncertainty in the 
well-constrained positive forcing of 1.7 + 0.2 Wm ~ that is due to 
carbon dioxide change. The aerosol indirect forcing therefore has a 
highly uncertain influence on climate change and has the potential to 
mask a significant portion of greenhouse gas warming’. 

The magnitude of the forcing caused by aerosol-cloud interactions 
depends on several poorly modelled aspects of the climate, but is broadly 
understood to stem from different treatments of aerosols, clouds and 
radiation***. Nevertheless, the fundamental driver is the change in 
aerosols from the pre-industrial period to the present day, which 
controls the change in cloud droplet concentrations. It is recognized 
that quantification of aerosol indirect forcing requires an understand- 
ing of both the pre-industrial aerosol state’’* and the effect of the 
substantial anthropogenic perturbation. However, because of the 
complexity of processes that determine cloud-forming aerosol con- 
centrations’ and the computational expense of global aerosol models 
which explicitly simulate their production and loss processes'*""*, a 
comprehensive assessment of the magnitude and leading causes of 
uncertainty in indirect forcing has not been attempted. 


Perturbed parameter simulations 


Here we carry out a variance-based sensitivity analysis of a global aerosol 
model to attribute the uncertainty in the aerosol first indirect forcing to 
uncertainties in the emissions and processes that control changes in 
aerosol over the industrial period. We perform an ensemble of perturbed 
parameter global aerosol microphysical model simulations using present- 
day (PD) and pre-industrial (PI) emissions (PD is defined as the year 
2000 and PI is defined” as 1750, with additional PI simulations to test 
the effect of using alternative reference years of 1850 or 1900). The 168 


1-year model simulations in the PI and PD periods cover the full 
expert-elicited uncertainty space of 28 parameters describing natural 
and anthropogenic aerosol emissions, aerosol precursor gas emissions, 
microphysical processes and structures of the aerosol model (see Methods). 
To characterize fully the magnitude and causes of model uncertainty 
we use Bayesian emulators conditioned on the ensemble data to gene- 
rate continuous model output across the parameter space’*'’. This 
approach enables a Monte Carlo sampling of the model uncertainty 
space” so that a full variance-based sensitivity analysis of the model 
outputs can be performed. 


Radiative forcing uncertainty 


Figure 1 shows the annual mean first indirect radiative forcing and the 
associated 1o uncertainty when assuming the 1750 reference state. 
The global annual mean indirect forcing is —1.16 Wm * (6 =0.22Wm ~, 
95% confidence interval —0.7W m 7 to —1.6W m ”), compared to the 
multi-model range reported in ref. 2 of —0.4Wm * to —1.8Wm 7 
(best estimate, —0.7 Wm *) and an estimate (—0.6 + 0.4Wm *) based 
on assimilated PD aerosol optical depth’. Our estimated 95% confid- 
ence interval is slightly narrower than the multi-model ensemble 
range, most probably because the latter includes structural differences 
in the host model aerosol, cloud and radiation schemes”**. Nevertheless, 
improved understanding of the aerosol processes and emissions would 
clearly help to reduce uncertainty in model forcing calculations. 

The seasonal variation of global mean forcing and the contribu- 
tions of different parameters to the uncertainty are shown in Fig. 2a 
and b (see the Methods and Extended Data Table 3 for the definition 
and elicited range of each parameter). The eight most important para- 
meters account for 92% of the forcing variance: volcanic SO, emissions, 
anthropogenic SO, emissions, dimethyl sulphide (DMS) emissions from 
marine biota, the width of the accumulation mode, dry deposition of 
accumulation mode aerosol, sub-grid sulphate particle formation, the 
width of the Aitken mode, and the diameter of emitted fossil fuel com- 
bustion particles. Several parameters that have a large effect on uncer- 
tainties in PD concentrations of cloud condensation nuclei (CCN)'* 
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Figure 1 | The global distribution of annual mean aerosol first indirect 
forcing and associated uncertainty. a, First indirect forcing; b, standard 
deviation o of forcing. The maps were computed from a Monte Carlo sampling 
of an emulator of forcing in each grid cell of the model. 


have a small effect on forcing uncertainty, which is related to the way 
they perturb PI and PD CCN, as we discuss below. 

A striking aspect of the results is the large contribution to the global 
mean forcing uncertainty from emissions of natural aerosol and pre- 
cursor gases. Together, emissions of volcanic SO, marine DMS, biogenic 
volatile organic carbon (forming secondary organic aerosol), biomass 
burning and sea spray account for 45% of the global annual mean 
forcing variance (Fig. 2b). This compares with 34% of the annual mean 
forcing variance that is due to the eight parameters associated with 
anthropogenic fossil fuel, biofuel, SO, and sulphate particle emissions. 
The biomass burning emissions were perturbed as a single parameter 
and not separated into natural wildfires and anthropogenic biomass 
burning. However, the annual mean 2% contribution of biomass burn- 
ing to the forcing variance means that the natural-anthropogenic split 
is not important to our overall conclusions. Moreover, the seasonality 
of the uncertainty caused by biomass burning suggests that it can be 
attributed mostly to northern mid-latitude emissions associated with 
natural fires (see Methods). 

The relative contribution of different parameters to the uncertainty 
depends on the sampled range in the ensemble (Extended Data Table 3). 
The range for DMS (—50%/+ 100%) is consistent with assessments of 
multiple emission parameterizations” and the same range for volcanic 
SO, is plausible given the uncertainty in sources”. However, our assumed 
range of —40%/+50% for the main anthropogenic aerosol uncertainty 
(SO, emissions) is high compared to the most recent inventories. Thus, 
it is likely that our estimate of the natural aerosol effect on forcing uncer- 
tainty is an underestimate. 
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It is important to note that most of the natural emissions do not, 
by themselves, cause a forcing over the industrial period because the 
emission source strengths were defined to be the same in the PI and 
PD simulations (except for biomass burning); that is, a high setting of 
the emissions in the PI simulation was paired with the same high set- 
ting in the PD. However, natural emissions affect the uncertainty in 
the aerosol first indirect forcing because they affect the background 
aerosol state upon which the forcing is calculated (see below). 

Because our variance-based approach considers parameter inter- 
actions, we are able to establish that the large contribution of natural 
aerosol emissions to forcing uncertainty is not strongly dependent on 
the magnitude of the other parameters. This is important information 
because, for example, the large sensitivity of forcing to natural sulphur 
emissions could be overemphasized if particle formation rates due to 
sulphuric-acid-driven nucleation were too high in the model. However, 
such parameter interactions can be quantified as the difference between 
the sum of the main effect variances (coloured bars in Fig. 2b) and the 
total variance (100% in Fig. 2b); see the Methods. Interactions generally 
account for less than 10% of the total forcing variance, demonstrating 
that the ranked uncertainty results are robust to uncertainties in the 
model set-up. 

The effect of some parameters on forcing uncertainty could be under- 
estimated if the parameter varied in an unknown way between the Pl and 
PDeras, which we have not accounted for here. For example, if DMS or 
volcanic emissions were at the low end of the uncertainty range in the 
Pl and at the higher end in the PD then the resulting increase in sulphate 
aerosol over this period would constitute an additional uncertainty in 
the forcing”. It is plausible that natural emissions change over time, 
implying that the uncertainty attributable to these parameters could be 
underestimated. Whether other parameters behave in this way depends 
on the extent to which the model processes represent an absolute 
understanding or whether they have been inadvertently tuned to con- 
ditions in the PD atmosphere. 


Alternative reference years 


The contribution of natural emissions to the forcing uncertainty will 
depend on the reference year that is used. The 1750 reference, used 
here, is commonly assumed to represent a pristine PI state, whereas 
early industrial decades from the 1850s onwards have also been used”. 
To test the effect of using alternative years for forcing, we repeated our 
calculations for the periods 1850-2000, 1850-1980 and 1900-2000 
(limited to June to reduce computational cost). For the alternative refe- 
rence years we used the same natural emissions as in 1750, but different 
anthropogenic emissions (see Methods). As expected, the indirect for- 
cing is lower when a slightly polluted reference year is used (a June 
mean of —1.30Wm ” for 1850 and —0.96 Wm 7 for 1900, versus 
—142Wm ° for 1750). The uncertainty analysis shows that the 
standard deviation of forcing is slightly larger when the reference year 
is 1850 (6 = 21% of mean) than for 1750 (6 = 19% of mean); see 
Extended Data Table 4. However, the uncertainties in 1850 emissions 
are likely to be larger than for the year 2000, which we have not 
attempted to account for, so we expect our estimate of 1850-2000 for- 
cing uncertainty to be an underestimate. The contribution of anthro- 
pogenic emissions to the forcing uncertainty is also greater using an 
1850 reference (46% of variance, versus 38% using 1750). This change 
compared to 1750 is mainly caused by the increased contribution from 
fossil fuel and sulphate particle emissions. These results show that 
natural emissions remain a substantial part of the forcing uncertainty 
even when slightly polluted reference years are used. They also confirm 
that the uncertainty in forcing is strongly sensitive to the assumed PI 
emissions, whether natural or anthropogenic: the large absolute change 
in anthropogenic emissions between 1980 and 2000 causes hardly any 
change in the contribution of anthropogenic emissions to the uncer- 
tainty in forcing referenced to 1850 (47% versus 46% of variance; see 
Extended Data Table 4), but the small absolute change in emissions 
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Figure 2 | Magnitude and sources of uncertainty in global mean aerosol first 
indirect forcing. a, Global mean forcing and two-standard-deviation 
uncertainty range. b, Seasonal cycle of the contribution of different groups of 
parameters to global monthly mean forcing variance (green, natural emissions; 
pink, anthropogenic emissions; blue, aerosol processes). The difference 


between 1750 and 1850 causes a large change in the anthropogenic 
contribution (38% to 46%). 

Although not a focus of this work, our results also suggest that most 
of the aerosol-cloud forcing has probably been realized by 1980 and 
has changed little between 1980 and 2000 (comparing 1850-1980 and 
1850-2000 time periods; see Extended Data Table 4). This, combined 
with the uncertainty analysis, indicates that it may be possible to place 
a relatively tight constraint on the aerosol forcing over recent decades 
(compared to the PI-to-PD period), which would help determine the 
forcing contribution to the reduction in warming trend. 


Importance of natural aerosols 


The large contribution of natural aerosol emissions to forcing uncer- 
tainty is caused by three factors (Fig. 3a—d). First, the lo response of 
CCN to changing DMS emissions is 44% higher in the PI period than 
in the PD period (¢ = 12.9 cm ° versus 9.0cm™ *) between 60° S and 
60° N in July, caused by the more efficient nucleation of new particles 
in the cleaner PI atmosphere. Second, cloud droplet number concen- 
trations increase more steeply with CCN when concentrations are low 
in the PI era. This effect occurs because high droplet concentrations 
limit the in-cloud supersaturation. Third, cloud albedo sensitivity to 
changes in cloud droplet concentrations (the susceptibility’) is higher 
in the Pl era because the dependence of albedo A on droplet number N 
is approximately dA/dN = A(1 — A)/3N. The combined effect of these 
three factors means that the forcing uncertainty is more strongly affected 
by natural aerosol (which affects PI cloud albedo in a near-linear way) 
than PD anthropogenic aerosol (which affects PD albedo in a sub-linear 
way). For the region of high forcing off the coast of Chile, CCN con- 
centrations rise in our model by a factor of 6.6 (75 cm’ * in the PI period 
to 500 cm’ * in the PD). Under these conditions, and with a cloud albedo 
of 0.5, the sensitivity of albedo to CCN is about 15 times higher in the PI 
period than in the PD. 
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between the sum of variances and 100% is accounted for by parameter 
interactions. c, Global annual mean forcing uncertainty range due to specific 
parameters and groups of parameters. The box shows the interquartile range 
and the whisker shows the 9%-91% confidence interval. The definition of each 
parameter is given in Extended Data Table 3. 


Although natural aerosol emissions cause only a small uncertainty 
in PI and PD CCN, they can have a disproportionate effect on forcing 
uncertainty compared to some process parameters, which we illustrate 
for typical conditions in Fig. 3e. For example, aerosol deposition processes 
were identified as a major uncertainty in PD CCN’. However, uncer- 
tainty in such a process causes an uncertainty in CCN that is in propor- 
tion to the aerosol abundance (so the absolute error in CCN will be higher 
in the PD than in the PI). In contrast, the uncertainty in DMS emissions 
causes approximately the same absolute error in CCN in the PI and PD. 
As shown in Fig. 3e, the relationship between CCN, cloud droplet con- 
centrations and albedo means that there is some degree of cancellation of 
errors in the case where the CCN error is proportional to the amount of 
aerosol. For the illustrative conditions used in Fig. 3e, when PD CCN 
concentrations are about double the PI levels, the uncertainty in forcing is 
a factor of ten higher in the absolute case than in the proportional case. 


Implications 

Our study provides the first assessment of how aerosol processes and 
emissions affect the uncertainty in indirect forcing between the PI and 
PD periods and provides quantitative support for previous studies 
that have highlighted the importance of understanding PI aerosol”-”’. 
There are several implications. First, it will prove difficult to constrain 
the sources of forcing uncertainty by making observations in the PD 
atmosphere’’”', because the low sensitivity of PD clouds to these 
emissions'*~* is unrepresentative of the PI atmosphere. We would 
need to understand the effects of natural emissions on PI-like aerosol. 
The nearest equivalent we have to PI conditions in the PD atmosphere 
is in very clean environments’, but the spread of perturbed particle 
concentrations'®*° may make it difficult to observe sufficiently pristine 
environments, except over very remote marine locations”®. Second, 
because the magnitude of the PI emissions themselves (notably volcanic 
and DMS) are now unmeasurable, some of the forcing uncertainty in 
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Figure 3 | Schematic explaining the importance of natural emissions for 
forcing uncertainty. a, CCN concentrations are more sensitive to emissions of 
sulphur precursor gases in the PI era because the condensation sink of the 
nucleating sulphuric acid vapour onto existing aerosol is lower. b, Cloud 
droplet concentrations are more sensitive to changes in CCN when droplet 
concentrations are low because higher droplet concentrations suppress 
in-cloud supersaturation and limit the activation of additional aerosol particles. 
c, Cloud albedo is more susceptible to changes in cloud droplet concentration 
when concentrations are low’. d, All three effects in a—c lead to a much 
higher sensitivity of albedo to precursor gas emissions in the PI era. 

e, Calculated effect of how the uncertainty in modelled aerosol affects the 
uncertainty in forcing. In this example, it is assumed that the CCN 
concentration scales in direct proportion with anthropogenic emissions 
(horizontal axis), as occurs approximately in the model. Uncertainties are then 
applied to CCN (+ACCN). The green lines shows the uncertainty in forcing 
when ACCN is proportional to the CCN concentration and the blue lines shows 
a case where ACCN is constant and independent of the anthropogenic 
emissions. The initial PI CCN concentration is 50 cm °, rising to a maximum 
of 750cm ° in the PD. The cloud droplet number concentration (CDNC) is 


climate simulations may be irreducible’. Therefore, empirical esti- 
mates of PI-to-PD forcing based on observations of aerosol and aero- 
sol-cloud relations under PD conditions’'’* may not be accurate. 
Third, efforts to constrain the magnitude of equilibrium climate sensi- 
tivity based on net forcing and ocean heat content relative to a PI 
reference” will always be hampered by our limited ability to constrain 
the natural aerosol state. Although it makes sense to define the forcing 
relative to a PI reference period when the forcing was zero and the 
Earth was approximately in energy balance, this does not imply zero 
error contribution from the reference state. We have shown that the 
uncertainties in PI-to-PD forcing are strongly affected by the PI aerosol. 
Other reference periods could be considered, but equilibrium climate 
sensitivity studies would need to account for uncertainties in ocean 
heat content. A final implication is that the major sources of uncer- 
tainty will depend on the period over which forcing is calculated, so 
future aerosol indirect forcing may be sensitive to a different set of 
parameters. Therefore, accurate simulation of past forcing, if this could 
be achieved, may not guarantee accurate future estimates. Furthermore, 
other aerosol-cloud interaction effects not considered here** may also 
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calculated as CDNC = 375 X (1 — exp(—0.0025 X CCN)) (ref. 28). The albedo 
A of the baseline cloud is assumed to be 0.5 and the albedo versus CDNC is 
dA/dIn(CDNC) = A(1 — A)/3A (ref 1). The forcing is calculated according to 
—Fy T2 AA, where AA is the change in albedo from the PI value (0.5), 

T, is the transmission of the atmosphere (assumed to be 0.75) and Fo is the 
radiative flux, assumed to be 340 Wm ”. The black line shows the calculated 
forcing assuming the baseline aerosol number concentration. The green line 
shows the calculated forcing assuming +30% uncertainty in CCN (35-65 cm ~ 
in the PI erato 525-975 cm * maximum in the PD). This calculation represents 
an uncertainty in aerosol concentrations due to a process that affects PI and 
polluted aerosol concentrations by the same factor, such as dry deposition. 
The blue line shows the calculated forcing assuming 15cm’ ° uncertainty in 
CCN (35-65 cm ° in the PI era, as in the scaled calculation, to a maximum 
polluted concentration of 735-765 cm’ *). This calculation represents an 
uncertainty in aerosol concentrations due to a process or emission that affects 
PI and polluted aerosol by approximately the same absolute amount, 

such as caused by uncertainty in DMS or volcanic SO, emissions. The small 
absolute change in aerosol has a much larger effect on forcing uncertainty than 
the scaled aerosol change. 


3 


depend nonlinearly on aerosol between the PI period and the PD, and 
the uncertainties could be dominated by a different set of parameters. 

Future efforts to reduce the uncertainty in simulated aerosol forc- 
ing need to combine measurements and models in ways that target 
sources of uncertainty, rather than relying on good model-observation 
agreement of PD aerosol as a measure of model fidelity. This means 
that models need to be based on sound microphysical processes and 
must not be reliant on tuning to PD aerosol levels. It is also essential for 
climate models and other models used for long-term simulations to 
include accurate representations of all natural aerosols, even if they 
have a small effect on PD aerosol levels in polluted environments. Even 
as future simulations of aerosol—cloud interaction become increasingly 
spatially resolved and able to capture more aerosol-cloud interaction 
processes and associated uncertainties, the uncertainty introduced 
from a poorly constrained pristine aerosol state will remain. 


METHODS SUMMARY 


The GLObal Model of Aerosol Processes (GLOMAP)!*"'® calculates the time- 
dependent global distribution of size-resolved aerosol particles, including the 
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microphysical processes of particle nucleation, growth, coagulation, cloud cycling 
and deposition. The model transport is prescribed in terms of three-dimensional 
gridded wind speed, temperature and humidity fields from the European Centre 
for Medium-Range Weather Forecasts analyses. Anthropogenic emissions (from 
fossil fuel, biofuel and biomass burning) were set to their 1750 (plus alternative 1850 
and 1900) values in the PI era and year 2000 values for the PD and were perturbed 
bya given factor (see Extended Data Tables 1 and 2). The uncertainty range for each 
parameter was chosen on the basis of expert elicitation'*. The model was run for 
168 combinations of parameter settings from among 28 parameters representing 
aerosol and precursor gas emissions, microphysical processes and aerosol model 
structures (see Extended Data Table 3). 

The radiative forcing between the PI era and the PD was calculated using a radia- 
tive transfer model” and a monthly and geographically varying data set of cloud 
optical depth*®. The diurnal cycle of incoming solar radiation was accounted for, 
but no diurnal cycle of cloud cover was assumed. Cloud droplet concentrations 
were calculated from the aerosol size distribution assuming a characteristic updraft 
speed of 0.15ms° ' over ocean and 0.3ms_' over land. The albedo change in each 
grid cell was then calculated in terms of the change in cloud droplet effective radius 
at constant liquid water path*’. Global mean forcings were calculated for 1750- 
2000, 1850-2000, 1900-2000 and 1850-1980 (see Extended Data Table 4). 

A Gaussian process emulator was built to describe the monthly and global mean 
PI-to-PD change in top-of-the-atmosphere radiation across the space of the 28 
uncertain parameters'®, and then a Monte Carlo sampling of the emulator was used 
to generate a probability distribution of global mean forcing. Variance decomposi- 
tion was used to quantify the fraction of variance attributable to the uncertain 
parameters. The forcing and uncertainty maps (Fig. 1) were generated by perform- 
ing a similar analysis on separate emulators for each model grid cell. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Model description. The GLObal Model of Aerosol Processes (GLOMAP-mode)'*!° 
is a three-dimensional global aerosol microphysics model that simulates the evolution 
of the particle size distribution and size-resolved chemical composition of aerosol 
particles on a global three-dimensional grid. The model has previously been evalu- 
ated against observations’ and improved by comparing aerosol simulations against 
a more detailed version of the model that treats the aerosol size distribution using a 
sectional approach*’, The GLOMAP models have been widely used and evaluated 
against global measurements of particle number concentrations****, CCN***®, aerosol 
chemical components*”~”, and cloud droplets*®. The aerosol module is run within 
the TOMCAT global three-dimensional offline chemistry transport model". The 
aerosol and chemical species are transported by three-dimensional meteorological 
fields read in from the European Centre for Medium-Range Weather Forecasts 
(ECMWE) ERA- Interim reanalyses for 2008. Aerosol transport is advanced every 
30 min by interpolating between the analyses, which are updated every 6 h and the 
aerosol microphysical/chemical processes are calculated ona range of shorter time- 
steps of less than 30 min. Uncoupling the aerosol from the model transport and 
meteorology in the chemistry transport model (so that aerosol does not affect mete- 
orology) is equivalent to the commonly used “double-call” approach in a climate 
model” in which the aerosol radiative effects are decoupled from the model physics 
so that particular radiative forcings can be diagnosed using pairs of model runs. The 
model was run ata horizontal resolution of 2.8° X 2.8° with 31 vertical levels between 
the surface and 10 hPa. 

The aerosol size distribution is defined by seven log-normal modes: one nuclea- 
tion mode and soluble and insoluble modes covering the Aitken, accumulation and 
coarse size ranges. The aerosol chemical components are sulphate, sea salt, black 
carbon, particulate organic matter and dust. Secondary organic aerosol is produced 
from the first stage oxidation products of biogenic monoterpenes and anthro- 
pogenic volatile-organic-carbon compounds, and is assumed to have zero vapour 
pressure. It is combined with the particulate-organic-matter component after 
kinetic condensation on the aerosol. The model includes dust emissions, but we 
do not perturb them because we focus on the effect on CCN concentrations, which 
we have previously shown are not strongly affected by dust particles even in intense 
dust storms”. 

The microphysical model resolves the main processes that shape the particle 
size distribution on a global scale: new particle formation, coagulation, gas-to- 
particle transfer, cloud processing, and dry and wet deposition. Wet deposition of 
particles occurs by two processes. In-cloud nucleation scavenging in which acti- 
vated particles form cloud droplets and are removed in precipitation and below- 
cloud impaction scavenging by falling raindrops. ECMWF meteorological fields 
are used to diagnose large-scale frontal precipitation and sub-grid convective preci- 
pitation is assumed to occur in 30% of the affected grid box area. Low-level strati- 
form clouds are read in separately from International Satellite Cloud Climatology 
Project (ISCCP) D2 data*®. In these clouds we assume that aerosol particles are 
activated and subsequently undergo ‘cloud processing’ in which sulphate mass is 
added to activated aerosol owing to the aqueous-phase oxidation of sulphur dioxide. 

Concentrations of the oxidants OH, O3, HO, and NO; and HO, were specified 
on the three-dimensional grid using six-hourly monthly mean concentrations 
from a TOMCAT simulation with detailed tropospheric chemistry“*. Concentra- 
tions of HO, are depleted through the aqueous-phase reaction with SO, and 
replenished through the reaction HO, + HO; (ref. 15). A spin-up period of six 
months was performed (three months of which had parameters set at their median 
value and formed the basis of a further three-month spin for each of the runs with 
the changed parameter settings). 

Emissions. The emission fluxes were perturbed by scaling baseline values, which 
are specified in Extended Data Table 1 for the 1750-2000 simulations, and in Extended 
Data Table 2 for the 1850-2000, 1900-2000 and 1850-1980 simulations. The Aerosol 
Comparisons between Observations and Models (AeroCom) emissions scenarios 
used for 1750 and 2000 are not available for the intermediate years. For the three 
additional time periods we therefore used the emissions prepared for the Atmos- 
pheric Chemistry and Climate Model Intercomparison Project (ACCMIP**) for 
1850, 1900, 1980 and 2000. For the sensitivity runs, we show results for June after 
two months of spin-up after the perturbations were applied. 

Cloud droplet number concentrations. CDNCs were calculated as a post-processing 
step using an activation parameterization*® and the modelled monthly mean aerosol 
size distribution and composition in each grid cell for each perturbed parameter 
run. These calculations account for the coupling between the uncertain aerosol 
particle size distribution (and composition) and the number of particles activated 
into cloud droplets. 

An updraught speed of 0.15 ms_' was used over marine regions and 0.3 m s~ 
over land, which is typical of cloud-base speeds in low-level stratus and stratocu- 
mulus clouds. Because updraft is highly variable in clouds it is normal to report 
updraft measurements as the standard deviation o ofa probability density function 


1 


of updrafts (normally centred on zero). However, it is possible to calculate CDNC 
using a single characteristic updraft speed (w*) that gives comparable results to 
using a probability density function of updrafts’’. The characteristic speed is given 
by w* = Bo, where B is a conversion factor constrained through closure studies to 
be in the range 0.65-0.8 (refs 47, 48). Here, we use w* = 0.15m s | over marine 
regions, which equates to ¢ = 0.19-0.23 ms _ 1 and w* =0.3ms ! overland, which 
equates to ¢ = 0.38-0.46 ms _'. The updrafts used to calculate CDNC should be 
representative of cloud base, where activation primarily occurs. Updrafts at cloud 
base are typically smaller than in-cloud updrafts as the latent heat released by conden- 
sation onto cloud droplets fuels higher in-cloud updrafts***°. For example, in measure- 
ments during the Marine Stratus/Stratocumulus Experiment (MASE) experiment”! 
the standard deviation rises from 0.3ms_' at cloud base to 0.6 ms at cloud top. 
Thus, we consider our updraft velocities to be appropriate for cloud base. 

To test the effect of higher updraft speeds, we recalculated CDNC for July using 
w* =0.25ms ' over oceans and 0.4ms | over land. We then built a new emu- 
lator for global annual mean forcing. The global mean forcing changes negligibly 
at the higher speeds. The fractional contributions to variance change from low to 
high updraft speed as follows: natural aerosol changes from 45% to 42%, anthro- 
pogenic emissions decrease from 34% to 33% and processes rise from 19% to 21%. 
Radiative forcing. The forcing was calculated as the difference of top-of-the- 
atmosphere net short-wave plus long-wave radiative fluxes between the PD and the 
PI periods. The PD and PI runs were performed using identical meteorological 
analyses. Each parameter perturbation run in the PI period was paired with its equi- 
valent setting in the PD period. The PI and PD simulations are therefore identical in 
every respect except for the anthropogenic emissions. The modelled aerosol proper- 
ties were then used to calculate the CDNC values in the PI and PD periods, from 
which the forcing was calculated for each two-dimensional grid point of the model. 

We used the off-line version of the Edwards and Slingo radiative transfer model” 
with six bands in the short-wave and nine bands in the long-wave, with a delta- 
Eddington two-stream scattering solver at all wavelengths. We used a monthly 
mean climatology for water vapour, temperature and ozone based on ECMWF 
reanalysis data, together with surface albedo and cloud optical depth fields from 
the International Satellite Cloud Climatology Project (ISCCP-D2)”° for the year 
2000. The diurnal cycle of incoming solar radiation was accounted for, but no 
diurnal cycle of cloud cover was assumed. The sensitivity of our forcing estimates 
to the cloud climatology is very small*', according to an extra set of calculations 
performed using the 1983-2008 multi-annual ISCCP cloud climatology. 

The cloud albedo forcing between the PI and PD experiments is quantified by 
modifying the cloud droplet effective radius r, for low- and mid-level water clouds 
up to 600 hPa: 


er_,pp ,, (CDNCPP) "”? 
CDNC™ 


where CDNC is the monthly mean cloud droplet number concentration in each 
grid cell. A fixed value for r? = 10 kum is used in order to ensure consistency with 
the ISCCP cloud retrievals. This overestimates the strength of the forcing because 
clouds with more droplets tend, on average, to have less water for reasons that are 
physically understood”. However, our conclusions about the relative sources of 
uncertainty are not affected. 

Perturbed parameters. The ensemble of model runs was designed to enable an 
emulator to be built. The ensemble consists of 168 combinations of parameter 
settings from 28 parameters representing aerosol and precursor gas emissions, 
microphysical processes and aerosol model structures. The uncertainty range for 
each parameter was chosen based on expert elicitation’* (see Extended Data Table 
3 for a complete list). Parameter combinations within the uncertainty range were 
defined by a maximin Latin Hypercube sampling of the parameter space. Paired 
simulations were run for one year for 1750 and 2000 using the AeroCom emis- 
sions (336 runs in total) and for one month for 1850, 1900, 1980 and 2000 using 
the ACCMIP emissions. Each parameter setting in the PI era was paired with the 
same parameter setting in the PD. Anthropogenic emissions (from fossil fuel, 
biofuel and biomass burning) were set to their 1750, 1850 or 1900 values in the PI 
era and the year 1980 and 2000 values for the PD and were perturbed by the same 
factor. 

Parameters 1 to 14 define aerosol microphysical processes and the definition of 
the size distribution modal parameters, whereas parameters 15 to 28 define the 
emission of aerosols and precursor gases. Full details about the parameter ranges 
and the process of expert elicitation are provided in ref. 18. Here we summarize 
the main aspects of the perturbed parameters. 

The boundary-layer nucleation parameterization (P1) assumes a rate j = A[HSO4(g)], 
with A (in units of s~') being the perturbed parameter. Throughout the atmos- 
phere (but important only in the free troposphere) we use a binary homogeneous 
H2SO,-H20 nucleation rate model scaled by an uncertain factor (P2). 
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Ageing is the process by which freshly emitted carbonaceous particles (for 
example, from biomass burning) can become increasingly water soluble as they 
accumulate water-soluble compounds during transport. The controlling para- 
meter, as used in many global models”, is the number of monolayers of soluble 
material required to convert the particles into water-soluble particles, which can 
then act as CCN. The lower limit of our assumed ageing means that initially 
insoluble particles become soluble on the timescale of hours in polluted regions. 
This process affects the number of aerosol particles able to form cloud droplets in 
the forcing calculation, and also affects the cloud processing of aerosol during run 
time of the model. 

The activation diameter (P4) defines the dry-equivalent diameter at which par- 
ticles are able to activate to cloud droplets during run time of the model. A single 
value of activation diameter is used globally in a given run. The activation dia- 
meter controls the formation of cloud droplets in all low-level clouds, and thereby 
controls which particles undergo cloud processing (sulphate production on the 
particles due to oxidation of sulphur dioxide) and removal in precipitation, and 
therefore shapes the particle size distribution. Cloud droplet concentrations from 
run-time of the model are not used in the forcing calculations (they are only used 
as a physical process that shapes the particle size distribution). For the forcing (see 
previous section) cloud droplet concentrations are quantified as a post-processing 
step based on the uncertain particle size distributions and an assumed updraft speed. 
In reality, the activation diameter in a given cloud updraft is controlled by the particle 
size distribution, solubility and updraft speed. The updraft speed was not included as 
a perturbed parameter because the chemical transport model does not have a para- 
meterization of updrafts in different cloud types and environments. Perturbation 
of the activation diameter accounts approximately for the uncertainty in updraft 
speed, but by prescribing a value in each simulation it ignores the way that the 
diameter is coupled to the properties of the aerosol size distribution in a given grid 
cell. This approach is likely to overestimate the uncertainty because it allows a larger 
variation in the number of aerosol particles that can be cloud processed, which 
would otherwise be damped by the coupling of particle number and activation 
diameter. Nevertheless, activation diameter makes a small contribution to global 
forcing uncertainty (Fig. 2c). 

The parameters SO203_CLEAN and SO203_POLL (P5 and P6) control the 
cloud droplet pH, which affects the production rate of aerosol sulphate from oxi- 
dation of sulphur dioxide by ozone. The pH is the perturbed parameter and can 
cause a change in rate by a factor of 10° for pH between 3 and 6. One parameter is 
used for clean (lower acidity) environments (sulphur dioxide less than 0.5 parts 
per billion) and one for polluted environments. 

The in-cloud scavenging diameter offset (P7) controls the diameter of aerosols 
that can be removed by in-cloud nucleation scavenging. This allows some parti- 
cles to be activated but not scavenged, assuming that the largest droplets initiate 
precipitation. The lower limit of P7 (zero nanometres) assumes all activated parti- 
cles are subject to removal during precipitation. 

The scavenging efficiency in ice-containing clouds (P8) controls the fraction of 
particles accessible to nucleation scavenging when air is below — 10 °C. Our previous 
work has shown this parameter to be important in controlling aerosol transport to 
the Arctic’*. Dry deposition of Aitken and accumulation mode particles (P9 and 
P10) is scaled for each particle size by a given factor. GLOMAP calculates the wind 
speed and size-dependent deposition velocity due to Brownian diffusion, impac- 
tion and interception. The accumulation and Aitken mode width parameters (P11 
and P12) define prescribed, globally constant geometric standard deviations of the 
log-normal size distribution modes. The mode separation diameters (P13 and P14) 
define the ranges over which the geometric mean radius can vary while staying ina 
particular mode. 

Fossil fuel, biofuel and biomass burning particle emission flux parameters (P15, 
P16 and P17) scale the mass emission fluxes in the PI based on the 1750, 1850 or 
1900 emissions and in the PD based on the year 2000 emissions. The spatial 
distribution of emissions is different in the PI and PD. Our perturbation accounts 
for uncertainty in the monthly mean flux but does not account for uncertainty in 
the spatial pattern or temporal variability in the emissions. Thus, for wildfires we 
are not able to separate the effects of more intense fires versus more frequent fires, 
which might have different effects on forcing uncertainty. The biomass burning 
from open fires was not separated into natural and anthropogenic emissions, so we 
cannot apportion the uncertainty. Examination of the variation of uncertainty due 
to these parameters shows a clear seasonal cycle, with a peak contribution to global 
mean variance of 5% in July and August, which can be attributed to northern mid- 
latitude wildfires, versus generally less than 1% at other times. The tropical fires are 
important for CCN"* but the uncertainty in forcing is limited by the low sensitivity 
of cloud albedo at very high CCN concentrations. Because most of the northern 
mid-latitude emissions can be associated with natural fires, we have associated 
the biomass burning uncertainties with the natural emissions. 
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Fossil fuel, biofuel and biomass burning particle emission sizes (P18, P19 and 
P20) directly control the number of emitted particles for a given mass flux, and 
therefore directly influence the CCN population. The sub-grid scale sulphate 
particle production parameters (P21 and P22) define the formation of particles 
in sub-grid scale power plant plumes”*”*. P21 defines the fraction of the emitted 
sulphur dioxide mass that enters the model grid square as new sulphate particles 
and P22 defines the dry size of these particles (and hence their number concen- 
tration for fixed mass) when emitted into the global grid box. 

The sea spray particle mass flux (P23) is scaled by a factor. GLOMAP-mode 
simulates sea spray particles between 35 nm and 20 tm dry diameter. This para- 
meter conflates multiple sources of uncertainty associated with the wind-speed 
dependence of the flux such as processes not unaccounted for in the parameteri- 
zations (for example, fetch), the wind speed, and the limited spatial resolution of 
the wind fields in the model. 

Anthropogenic sulphur dioxide emissions (P24) are scaled by a factor based on 
the emissions in 1750, 1850, 1900 and 2000, as described above. Continuously degass- 
ing volcanic sulphur dioxide emissions (P25) are scaled based ona global inventory 
widely used in global models*’. Time-averaged sporadic emissions are also included. 
Volcanic emissions are assumed to be the same in the PI and PD simulations. 

DMS emissions (P26) are controlled by the sea-water concentration of DMS** 
and the wind-driven transfer velocity parameterization”. We conflate these 
uncertainties by perturbing the calculated sea-air transfer flux by a given factor. 
This leads to identical absolute perturbations to the DMS flux in the PI era and the 
PD. We do not account for uncertainty in the spatial pattern of DMS seawater 
concentration, which is likely to be an important factor in the overall uncertainty”. 

The biogenic secondary-organic-aerosol production parameter (P27) conflates 
the uncertainty in the emissions of the precursor gases (biogenic volatile organic 
carbons) and the uncertainty in the yield of secondary-organic-aerosol material 
following oxidation reactions into a single parameter. P27 scales the volatile organic 
carbon emissions (with fixed chemical yield) such that global annual secondary- 
organic-aerosol production lies between the values given in the table. There are also 
uncertainties in the volatility of different compounds that we do not account for 
here. The range of emissions used here has been shown to span the range of global 
in situ measurements of organic aerosol”’. 

Anthropogenic secondary organic aerosol production (P28) is treated in a 
similar way to biogenic secondary organic aerosol, by conflating the uncertainty 
in emissions and yield into a single emission uncertainty. We used the same approach 
as in ref. 37 by scaling gridded carbon monoxide emissions over a range known to 
span the range of observed organic aerosol in the PD atmosphere. The range is then 
scaled further to account for the changes in carbon monoxide emissions in 1850, 
1900 and 1980. 

Model emulation. Gaussian process emulation'*”° was used to estimate model 
predictions at untried points throughout the space of the uncertain model para- 
meters. An emulator was built for the monthly mean first indirect radiative 
forcing for every two-dimensional grid point (to produce Fig. 1 in the main text) 
and for the global annual mean and monthly mean forcings to generate Fig. 2 in 
the main text. The emulator was validated in each case using 84 additional model 
runs (Extended Data Fig. 1) to ensure that the emulator uncertainty around its 
mean is low compared to the parametric uncertainty. Twenty-eight of the valida- 
tion runs were designed to lie near the training points and 64 were defined using a 
separate Latin Hypercube design™. The coefficient of determination (7°) of the 
global annual mean emulator forcing versus simulated forcing is 0.94. 
Variance-based sensitivity analysis. Variance-based sensitivity analysis is used 
to decompose the uncertainty in the model predictions to the uncertainty in the 
model parameters. The total variance of the forcing was calculated by sampling 
from the emulator mean function using the extended-FAST method”. We sampled 
5,000 points per parameter (140,000 in total) from the emulator to obtain a proba- 
bility distribution of forcing. Two measures of sensitivity were calculated: the main 
effect index measures by how much the variance will be reduced if the parameter 
can be learnt precisely, and the total effect index measures both the individual effect 
and the interaction effect of each parameter with all others. The two sensitivity 
measures are compared to assess the sensitivity of the model output to interactions. 
Figure 2b (see main text) shows that parameter interactions account for generally less 
than 10% of the monthly global mean forcing variance (shown as the residual white 
space above each coloured bar). We note that a nonlinear response of the model output 
to a parameter across the specified range is accounted for in the main effect variance. 
Uncertainty results using different reference periods. Extended Data Table 4 
presents results for the additional simulations in which alternative reference years 
were used for the calculation of forcing (1850 and 1900 instead of 1750), as well as 
for the period 1850-1980. 
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Extended Data Figure 1 | Validation of the global annual mean forcing 


emulator. The error bars show the emulator 95% uncertainty range around the 
mean prediction. The 1:1 line is shown. 


Extended Data Table 1 | Emissions of aerosols and precursor gases used in the 1750-2000 simulations 


Aerosol source Emitted species PD (2000) flux PI (1750) flux Ref. 
Fossil fuel BC / Tg C per year 3.0 0.0 17 
POM / Tg POM per year 3.2 0.0 17 
Power stations SO2/ Tg S per year 24.2 0.0 17, 62 
Industrial processes SO2 19.6 0.0 17, 62 
Transportation SO2 48 0.0 17, 62 
Off-road SO2 0.8 0.0 17, 62 
Biofuel BC / Tg C per year 1.6 0.4 17, 63 
POM / Tg POM per year 9.1 1.6 17, 63 
Domestic SOz2 / Tg S per year 4.8 0.12 17, 63 
Wildfires BC / Tg C per year 3.1 1.03 17, 64 
POM / Tg POM per year 34.7 12.8 17, 64 
SO2/ Tg S per year 2.1 1.46 17, 64 
Volcanoes SO2/Tg S per year 12.6 12.6 57 
Marine dimethyl sulphide _ DMS*/ Tg S per year 17.1 17.1 58 
Sea spray Salt Wind-dependent Wind-dependent 65 
flux flux 
Biogenic volatile organic | Monoterpenes 5-360 5-360 37, 66 (for 
carbon (a-pinene) / Tg POM spatial/temporal 
produced per year variation) 
Anthropogenic volatile VOC /Tg POM produced 2-112 0.0 37 
organic carbon per year 


*The DMS emission flux is a global annual value but emissions are calculated at each time step based on the seawater DMS concentration field®* and a sea-air transfer velocity®’. These baseline emissions were 
perturbed according to the factors given in Extended Data Table 3, except for the volatile organic carbon emissions, which were perturbed over the absolute range indicated in this table and in Extended Data Table 3. 
BC, black carbon; POM, particulate organic matter; VOC, volatile organic carbon. 
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Extended Data Table 2 | Emissions of aerosols and precursor gases used in the 1850-2000, 1900-2000 and 1850-1980 simulations. 


Aerosol source Emitted species 2000 flux 1980 flux 1900 flux 1850 flux Ref. 
Fossil fuel BC / Tg C per year 3.0 24 0.4 0.1 45 
POM / Tg POM per 58 35 08 02 45 
year 
Power stations SO2/ Tg S per year 24.8 30.7 1.8 0.0 45 
nalustnal SO2 8.8 11.3 5.6 0.6 45 
processes 
Transportation SQOo2 7.7 6.9 0.9 0.0 45 
Off-road SO2 5.6 3.6 0.9 0.0 45 
Biofuel BC / Tg C per year 2.2 25 2.0 1.0 45 
eee! 12.4 12.0 8.9 6.4 45 
year 
Domestic SO2/ Tg S per year 4.3 6.2 2.6 0.4 45 
Wildfires BC / Tg C per year 1.03 1.03 1.03 1.03 17, 64 
BONG POM per 12.8 12.8 12.8 12.8 17, 64 
year 
SO2/ Tg S per year 1.46 1.46 1.46 1.46 17, 64 
Volcanoes SO2/ Tg S per year 12.6 12.6 12.6 12.6 57 
Marine dimethyl 
sulphide DMS / Tg S per year 1A 1A 17.1 17.1 58 
Wind- Wind- Wind- Wind- 
Sea spray Salt dependent dependent dependent dependent 65 
flux flux flux flux 
Biogenic volatile Monoteipenes 
g (a-pinene) / Tg POM 5-360 5-360 5-360 5-360 37, 66 


organic carbon 


oduced per year 


Ee  —— 


Anthropogenic 37,45 

volatile organic voer Io rey 2.7-145.0 2.6-150.0 0.5-26.5 0.3-15.0 (CO 
produced per year os 

carbon emissions) 
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Extended Data Table 3 | Parameters and their maximum ranges used in the model simulations. 


Parameter Parameter name Description Uncertainty range Effect 

number 

Process parameters 

P1 BL_NUC Boundary layer nucleation rate 3.2x10'-2x10%s' Absolute 

P2 FT_NUC Free troposphere nucleation rate 0.01 - 10 Scaled 

P3 AGEING Ageing ‘rate’ from insoluble to soluble 0.3 - 5 monolayer Absolute 

P4 ACT_DIAM Cloud drop activation dry diameter 50 - 100 nm Absolute 

PS $0203_CLEAN pH of cloud drops (controls SO2 + O3) pH4-6.5 Absolute 

P6 $0203 _POLL pH of cloud drops (SO2 + Os) pH 3.5-5 Absolute 

P7 NUC_SCAV_DIAM Nucleation scavenging offset dry 0-50nm Absolute 
diameter 

P8 NUC_SCAV_ICE Nucleation scavenging fraction 0-1 Scaled 
(accumulation mode) in mixed and ice 
clouds (T< —-15 °C) 

P9 DRYDEP_AIT Dry deposition velocity of Aitken mode 0.5-2 Scaled 
aerosol 

P10 DRYDEP_ACC Dry deposition velocity of accumulation 0.1- 10 Scaled 
mode aerosol 

P11 ACC_WIDTH Mode width (accumulation 1.2-1.8 Absolute 
soluble/insoluble mode) 

P12 AIT_WIDTH Mode width (Aitken soluble/insoluble) 1.2-1.8 Absolute 

P13 NUC-AIT_SEP Mode separation diameter 9-18nm Absolute 
(nucleation/Aitken) 

P14 AIT-ACC_SEP Mode separation diameter 0.9 -2xACT_DIAM Scaled 
(Aitken/accumulation) 

Aerosol and precursor gas emissions 

P15 FF_EMS BCOC mass emission rate (fossil fuel) 0.5-2 Scaled 

P16 BB_EMS BCOC mass emission rate (biomass 0.25 - 4 Scaled 
burning) 

P17 BF_EMS BCOC mass emission rate (biofuel) 0.25 - 4 Scaled 

P18 FF_DIAM BCOC emitted number median dry 30 - 80 nm Absolute 
diameter (fossil fuel) 

P19 BB_DIAM BCOC emitted number median dry 50 - 200 nm Absolute 
diameter (biomass burning) 

P20 BF_DIAM BCOC emitted number median dry 50 - 200 nm Absolute 
diameter (biofuel) 

P21 PRIM_SO4_FRAC Mass fraction of SO2 converted to new 0- 1% Scaled 
sulphate particles in sub-grid plumes 

P22 PRIM_SO4_DIAM Emitted number median dry diameter of 20 - 100 nm Absolute 
sub-grid sulphate particles 

P23 SEASPRAY Sea spray mass flux 0.2-5 Scaled 
(coarse/accumulation) 

P24 ANTH_SO2 SO2 emission flux (anthropogenic) 0.6- 1.5 Scaled 

P25 VOLC_SO2 SO2 emission flux (volcanic) 0.5-2 Scaled 

P26 DMS_FLUX DMS emission flux 0.5-2 Scaled 

P27 BIO_SOA Biogenic monoterpene production of SOA = 5 - 360 Tg POM Absolute 

per year * 
P28 ANTH_SOA Anthropogenic VOC production of SOA 2-112 Tg POM Absolute 
per year * 


The Latin Hypercube sampling of parameter combinations was designed to lie within these ranges. In the ‘effect’ column, for the scaled parameters the magnitude of the parameter was multiplied by a factor lying 
between the maximum and minimum given in the ‘uncertainty range’ column and for absolute adjustments, the parameter was set to a value within the range of absolute values in the ‘uncertainty range’ column. 
BCOC, black carbon/organic carbon; SOA, secondary organic aerosol. *The values given refer to the mass of POM produced, although the perturbations were actually applied to the emitted VOCs. The POM range 
refers to year 2000. For other years see Extended Data Table 2. The baseline emissions are given in Extended Data Tables 1 and 2. 
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Extended Data Table 4 | Results for the different periods. 


Period Emissions Forcing Forcing Forcing Percentage Percentage Percentage 
(W m*) standard — standard of variance of variance of variance 
deviation deviation due to due to due to 
(W m”*) divided by __ natural anthropogenic aerosol 
forcin emissions emissions rocesses 
Annual mean values 
1750-2000 AeroCom -1.16 0.22 0.18 45 34 14 
June mean values 
1750-2000 AeroCom -1.42 0.27 0.19 45 38 13 
1850-2000 ACCMIP -1.30 0.28 0.22 37 46 11 
1900-2000 ACCMIP -0.96 0.20 0.21 37 43 13 
1850-1980 ACCMIP -1.29 0.27 0.21 38 47 11 


All numbers refer to global means. The last three columns refer to percentage of variance. 
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SHANK3 overexpression causes 
manic-like behaviour with unique 
pharmacogenetic properties 


Kihoon Han'’, J. Lloyd Holder, Jr**°, Christian P. Schaaf?, Hui Lu’??, Hongmei Chen?*°, Hyojin Kang}, Jianrong Tang*, 
Zhenyu Wu**, Shuang Hao**, Sau Wai Cheung"’, Peng Yu’, Hao Sun**°, Amy M. Breman"’, Ankita Patel!’, Hui-Chen Lu? 
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Mutations in SHANK3 and large duplications of the region spanning SHANK3 both cause a spectrum of neuropsychiatric 
disorders, indicating that proper SHANK3 dosage is critical for normal brain function. However, SHANK3 overexpression 
per se has not been established as a cause of human disorders because 22q13 duplications involve several genes. Here we 
report that Shank3 transgenic mice modelling a human SHANK3 duplication exhibit manic-like behaviour and seizures 
consistent with synaptic excitatory/inhibitory imbalance. We also identified two patients with hyperkinetic disorders 
carrying the smallest SHANK3-spanning duplications reported so far. These findings indicate that SHANK3 overexpression 
causes a hyperkinetic neuropsychiatric disorder. To probe the mechanism underlying the phenotype, we generated a 
Shank3 in vivo interactome and found that Shank3 directly interacts with the Arp2/3 complex to increase F-actin levels 
in Shank3 transgenic mice. The mood-stabilizing drug valproate, but not lithium, rescues the manic-like behaviour of 
Shank3 transgenic mice raising the possibility that this hyperkinetic disorder has a unique pharmacogenetic profile. 


An increasing number of neuropsychiatric disorders such as autism spec- 
trum disorder (ASD), intellectual disability, schizophrenia, obsessive- 
compulsive disorder and bipolar disorder are being classified as ‘synap- 
topathies'~*, because the genes mutated in these disorders lead to 
abnormal synaptic development or function. For example, all members 
of the SHANK gene family (SHANK1, SHANK2 and SHANK3), which 
encode for core scaffolding proteins organizing macromolecular com- 
plexes at the postsynaptic density (PSD)°, have been linked to human 
synaptopathies® *. Point mutations in SHANK3 (also called ProSAP2) 
are seen in non-syndromic autism, intellectual disability and schizo- 
phrenia*’, and deletions of the region containing SHANK3 cause 
Phelan-McDermid syndrome (22q13 deletion syndrome)'**. Further- 
more, mice modelling SHANK3 deletion mirror the human beha- 
vioural phenotypes and display synaptic abnormalities”. 

Interestingly, a few 22q13 duplications spanning SHANK3 have 
been reported in patients diagnosed with Asperger syndrome, atten- 
tion deficit hyperactivity disorder (ADHD) or schizophrenia”'®”’, 
indicating that increased expression of SHANK3 could be also deleteri- 
ous. Because duplication cases involve large genomic regions (>0.8 Mb) 
with more than 20 genes, however, it is unclear if it is SHANK3 over- 
expression that causes a neurological phenotype. Here we describe the 
generation and characterization of Shank3-overexpressing mice and 
studies on the role of SHANK3 duplication in hyperkinetic neuropsy- 
chiatric disorders in humans. We identify a mechanism that contri- 
butes to the neuronal phenotype and report pharmacological therapies 
that might benefit individuals with SHANK3 overexpression. 


Manic-like behaviour in Shank3 transgenic mice 


We introduced an amino-terminal enhanced green fluorescent protein 
(EGFP) tag into mouse Shank3 to generate EGFP-Shank3 transgenic 


mice (Fig. la and Extended Data Fig. la). Similar to endogenous 
Shank3 (ref. 16), EGFP-Shank3 transcript was detected in cortex, hippo- 
campus and striatum (Fig. 1b). The regional and developmental expres- 
sion of EGFP-Shank3 recapitulated those of endogenous Shank3 (Extended 
Data Fig. 1b-d). EGFP-Shank3 was targeted to the dendritic spines and 
co-localized with excitatory postsynaptic marker PSD-95, but not with 
inhibitory postsynaptic marker gephyrin (Fig. 1c). Compared to wild- 
type mice, transgenic mice expressed 1.2-fold to twofold higher levels 
of each of the three major Shank3 isoforms («, B and y), with ~50% 
more total Shank3 protein (Fig. 1d and Extended Data Fig. le, f), which 
parallels expression in human SHANK3 duplication patients. There 
was no significant change in levels of other synaptic proteins in the hippo- 
campus and striatum of transgenic mice (Extended Data Fig. 1g, h). 

Our Shank3 transgenic mice showed increased locomotor activity 
and speed compared with wild-type littermates in the open field test 
(Fig. 2a, b and Extended Data Fig. 2a). Transgenic mice were also 
hyperactive in their home cages, indicating that their behaviour in the 
open field was not due to the novel environment (Extended Data Fig. 2b). 
Given that one patient carrying a large 22q13 duplication spanning 
SHANK3 has been reported to have ADHD"°, we sought to determine 
whether the hyperactivity in the transgenic mice is related to ADHD by 
testing whether it could be corrected by amphetamine, a treatment for 
ADHD. Surprisingly, rather than the expected paradoxical calming effect, 
an acute injection of amphetamine aggravated the hyperactivity of 
transgenic mice to a greater degree than wild-type mice (Fig. 2c, d). As 
increased sensitivity to amphetamine is one characteristic of mania°— 
bipolar disorder can be misdiagnosed as ADHD, especially in children 
due to the overlap in diagnostic criteria*—we pursued behavioural 
tests to determine if there are abnormalities typically associated with 
a manic state. 
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Figure 1 | Characterization of EGFP-Shank3 expression in Shank3 
transgenic mice. a, Diagrams show the modified Shank3 BAC and EGFP- 
Shank3. Kozak (ACCATGG) and EGFP-coding sequence were integrated into 
the start codon of Shank3. b, RNA in situ hybridization with a probe against 
EGFP detected EGFP-Shank3 in the brain. Cb, cerebellum; Ct, cortex; Hp, 
hippocampus; St, striatum; TG, transgenic; Th, thalamus; WT, wild type. 

c, EGFP-Shank3 localizes to excitatory postsynaptic sites in cultured 
hippocampal neurons. MAP2 is a dendritic marker. Scale bar, 10 pm. 

d, Quantification of the fold changes of Shank3 («, B and y isoforms) in 
synaptosomal fraction of whole brain (n = 4, biological replicates), 
hippocampus or striatum (n = 6) from 6-week-old mice. All data are presented 
as mean + s.e.m. *P < 0.05; unpaired two-tailed Student’s f-test. 


Immobility during the tail-suspension test has been used as an indi- 
cator of despair. Consistent with manic-like behaviour”', the duration 
of immobility in Shank3 transgenic mice was less than wild-type lit- 
termates (Fig. 2e); the transgenic mice also exhibited elevated acoustic 
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startle response with reduced prepulse inhibition (PPI), abnormal 
circadian rhythms, and hyperphagia-like behaviour (Fig. 2f-j and 
Extended Data Fig. 2c, d), all of which are behaviours observed in 
humans during manic episodes’. 

We normalized Shank3 levels by crossing the transgenic mice with 
Shank3B*'~ mice’ and found that this reversed hyperactivity, reduced 
immobility in tail-suspension and reduced PPI (Extended Data Fig. 3). 
Shank3 transgenic mice also displayed decreased social interaction 
(Extended Data Fig. 2e-g), which has also been seen in patients with 
large 22q13 duplications””*. Unlike Shank3 knockout mice’*"”’, Shank3 
transgenic mice did not exhibit repetitive behaviour (Extended Data 
Fig. 2h-j), but they made fewer calls during the ultrasonic vocalization 
test at postnatal day 13 (Extended Data Fig. 2k, 1). Together, these 
results suggest that ~50% increase in Shank3 level causes a hyperkinetic 
phenotype in mice that resembles mania, and that it is the increased 
dosage of SHANK3 in 22q13 duplications that probably contributes to 
hyperkinesia in humans. 


SHANK3 duplications in humans 


To confirm unequivocally the role of SHANK3 dosage in the neurop- 
sychiatric phenotypes, we queried the array Comparative Genomic 
Hybridization (aCGH) database at the Molecular Genetics Laboratory 
of Baylor College of Medicine and identified two cases that carry small 
duplications (Fig. 3a, b and Supplementary Data). 

The first individual is an 11-year-old girl diagnosed with ADHD, 
combined type. The 22q13 duplication includes the entire SHANK3 
and part of the ACR gene (Fig. 3a). ACR encodes acrosin, a protease 
expressed in sperm facilitating its penetration into oocytes, with no 
known expression or function in the brain. This individual has seizures 
and exhibits hyperactivity, poor attention, auditory overstimulation, 
hyperphagia and kleptomania. Notably, her hyperactivity is resistant to 
the paradoxical calming effect of stimulant medications seen in most 
cases of ADHD. The second individual is a 35-year-old man diagnosed 
with bipolar disorder and epilepsy who has a duplication encompassing 
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Figure 2 | Shank3 transgenic mice display manic-like behaviours. 

a, Transgenic mice show increased locomotor activity in the open field test. 
b, Transgenic mice did not habituate during the open field assay. 

c, d, Transgenic mice are hypersensitive to amphetamine. After 30 min of basal 
activity, amphetamine (2 mg per kg) or saline was administered (arrow) and 
locomotor activity was monitored for 60 min. e, Transgenic mice spend less 
time immobile in the tail-suspension test. f, Increased acoustic (120 dB) startle 
response of transgenic mice. g, Abnormal PPI of transgenic mice. 


h-j, Abnormal circadian rhythms of transgenic mice. h, Wheel running 
actograms of wild-type and transgenic female mice. After 8 days of light/dark 
(L/D) cycle, animals were released into constant darkness (arrow, D/D) for 

2 weeks. i, Among 20 transgenic mice tested, 3 displayed complete loss of 
rhythm during D/D. j, Transgenic mice showed increased period length and 
activity count, but normal alpha, compared to wild-type mice. All data are 
presented as mean + s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001. Statistical 
analyses for behavioural assays are in Supplementary Table 1. 
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Figure 3 | Individuals with SHANK3 duplications have hyperkinetic 
disorders. a, Array plot of an exon-targeted chromosome microarray analysis 
on DNA from an 11-year-old female with ADHD, seizures and aberrant 
behaviours. Black dots indicate probes with normal copy number and green 
dots indicate copy number gain. Solid and dotted lines define the minimum and 
maximum expected boundaries of the duplication, respectively. b, Array plot of 
the 35-year-old male with bipolar disorder and epilepsy. 


SHANK3, ACR and ARSA, the gene encoding arylsulfatase A (Fig. 3b). 
ARSA deficiency causes metachromatic leukodystrophy, and duplica- 
tions of ARSA have not been associated with any neurobehavioural 
phenotype. 

These two individuals carry the smallest 22q13 duplications span- 
ning SHANK3 reported so far, and their molecular and clinical findings 
support the notion that SHANK3 duplications result in hyperkinetic 
disorders and seizures in humans. More detailed clinical and molecular 
cytogenetic information about both individuals is provided in the sup- 
plementary material. 


Synaptic E/I imbalance in Shank3 transgenic mice 


Having found that humans with SHANK3 duplications have a hyper- 
kinetic disorder and seizures, we evaluated the Shank3 transgenic 
mice for seizures and synaptic abnormalities. We observed spontan- 
eous seizures in the Shank3 transgenic mice, but never in wild-type 
littermates (Supplementary Video 1). We monitored for electro- 
graphic seizures using electroencephalography (EEG). The EEG of 
transgenic mice showed hyperexcitability discharges accompanied 
by electrographic seizures (Fig. 4a). As alteration in the neuronal 
excitatory/inhibitory (E/I) balance is considered to cause seizures”, 
we examined morphological and functional changes of both excit- 
atory and inhibitory synapses. Cultured hippocampal pyramidal neu- 
rons from transgenic mice displayed increased VGLUT1-positive 
PSD-95 (VGLUT1 is also known as SLC17A7), an excitatory synaptic 
marker, and decreased VGAT-positive (also known as SLC32A1) 
gephyrin, an inhibitory synaptic marker, puncta density (Fig. 4b). 
We examined functional properties of synapses by whole-cell patch 
clamping in CA1 pyramidal neurons of acute hippocampal slices. 
Neuronal density, paired-pulse facilitation ratio, input-output relation- 
ship and intrinsic excitability of these neurons were similar in wild-type 
and transgenic mice, indicating normal basal synaptic transmission at 
Schaffer collateral-CA1 synapses of transgenic mice (Extended Data 
Fig. 4a-e). Consistent with decreased density of VGAT-positive gephyrin 
puncta in cultured hippocampal neurons from transgenic mice, we 
observed reduction in GABA, receptor-mediated miniature inhibitory 
postsynaptic current (mIPSC) frequency, but not in amplitude or decay 
time (Fig. 4c). AMPA receptor-mediated miniature EPSC (mEPSC) 
was normal, but amplitude of spontaneous EPSC (sEPSC, without 
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tetrodotoxin) was increased in transgenic mice (Extended Data Fig. 4f 
and Fig. 4d). There was no change in the AMPA/NMDA ratio in CA1 
pyramidal neurons of transgenic compared to wild-type mice (Fig. 4e). 
Consistently, NMDA receptor-dependent synaptic plasticity measured 
by extracellular field recordings at Schaffer collateral-CA1 pyramidal 
synapses was normal in trasngenic mice (Extended Data Fig. 4g, h). 
The spontaneous seizures of transgenic mice, together with morpho- 
logical and functional alterations in synapses, indicate that the synaptic 
E/I balance in transgenic neurons is shifted towards excitation. 


Arp2/3 complex mediates F-actin increase 


To understand the molecular mechanisms underlying the synaptic 
alterations of Shank3 trasngenic mice, we set out to identify the in vivo 
interactors of Shank3. We used the EGFP-tag of the Shank3 transgene 
to isolate Shank3 protein complexes (Extended Data Fig. 5a). Immuno- 
precipitation followed by mass-spectrometry analysis identified 273 
proteins that bind Shank3 in vivo (Supplementary Table 4). There 
was notable overlap between the proteins on this list, those from our 
previous yeast two-hybrid screening”, and known mouse or human 
PSD proteins**”” (Extended Data Fig. 5b). We combined our two 
Shank3-focused experimental lists (previous yeast two-hybrid screen- 
ing’ and current in vivo immunoprecipitation, for a total of 404 proteins 
called the Shank3 interactome) for further analysis (Extended Data 
Fig. 5c, d). 

We performed Gene Ontology (GO) and pathway analysis, which 
suggested that regulation of actin cytoskeleton is a principal role of the 
Shank3 interacting proteins (Extended Data Fig. 5e and Supplemen- 
tary Table 5). Based on this analysis, we proposed that abnormal 
regulation of the actin cytoskeleton could be one major molecular 
change in the synapses of transgenic mice. Accordingly, we evaluated 
the effects of a 50% increase in Shank3 on F-actin levels. We found 
that F-actin levels of excitatory synapses were increased by ~30% in 
cultured hippocampal pyramidal neurons from transgenic mice com- 
pared to wild-type mice (Fig. 5a), which was restored by expression of 
Shank3 small interfering RNA (siRNA) (Extended Data Fig. 6a, b). 
Next, we used our Shank3 interactome to generate a sub-network of 
actin-related proteins (Fig. 5b) to focus the search on molecular med- 
iators of the elevated F-actin. This network comprises both known 
and previously unknown Shank3 interactors, which we confirmed by 
western blotting (Extended Data Fig. 5f). The most interesting new 
Shank3 direct interactors we identified (based on both yeast two- 
hybrid screening and in vivo immunoprecipitation) were two subu- 
nits of the Arp2/3 complex (ARPC2 and ARPCSL) (Fig. 5b). Arp2/3 
complex consists of seven-subunits and binds directly to the F-actin 
mother filament to initiate nucleation and branching’*. Consistent 
with the interaction analysis, ARPC2 clusters co-localized with EGFP- 
Shank3 and significantly enlarged in transgenic neurons (Extended 
Data Fig. 5g and Fig. 5c); the latter observation was reversed by Shank3 
siRNA (Extended Data Fig. 6c). To initiate actin polymerization, Arp2/3 
complex must be activated by nucleation-promoting factors (NPFs)”*. 
Given that the NPFs WASF1 and cortactin (CTTN) directly interact 
with Shank3 (refs 29, 30), Shank3 could function as a scaffold that 
brings WASF1, cortactin and Arp2/3 together, facilitating the forma- 
tion of F-actin structures (Fig. 5d). Consistent with our hypothesis, co- 
localization of WASF1 and ARPC2 was enhanced in transgenic neurons 
(Fig. 5e). Furthermore, the density of dendritic spines was increased in 
transgenic brains compared to wild-type brains (Fig. 5f). 

Our analysis indicated that hippocampal pyramidal neurons of trans- 
genic mice have a reduced number of inhibitory synapses (Fig. 4b, c), 
which was unexpected based on the exclusive localization of EGFP- 
Shank3 in excitatory synapses (Fig. 1c). To understand the underlying 
mechanism, we first investigated inhibitory neurons, changes of which 
might secondarily affect inhibitory synapses on pyramidal neurons. 
However, we found that there was no difference in synapse number on 
the GAD-6-positive inhibitory neurons of transgenic mice, although 
these neurons express EGFP-Shank3 (Extended Data Fig. 7). 
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Figure 4 | Abnormal EEG and altered synaptic excitatory/inhibitory 
balance of Shank3 transgenic mice. a, Representative EEG traces from wild- 
type (n = 5) and transgenic (n = 10) mice. Transgenic mice showed prolonged 
hyperexcitability discharges and electrographic seizure in all recorded regions. 
The frequency of epileptiform spikes (dentate gyrus) and electrographic seizure 
were significantly increased in transgenic mice. b, Increased VGLUT1-positive 
PSD-95 (n = 28, three independent experiments) and decreased VGAT- 
positive gephyrin (m = 25) puncta density in cultured hippocampal pyramidal 


We next explored a possible cell-autonomous relationship between 
excitatory and inhibitory synapses of transgenic pyramidal neurons. 
We speculated that increased clustering of actin-related proteins in 
excitatory synapses might reduce some of those proteins in inhibitory 
synapses, thereby contributing to the decrease of inhibitory synapse 
number. We focused on actin-related proteins, because the actin cyto- 
skeleton defines the molecular architecture of both E/I synapses*’. We 
picked Mena (also known as Enah) and profilin as candidates for four 
reasons: they directly interact with gephyrin in inhibitory synapses”; 
profilinl and profilin2 are recruited to excitatory synapses by neur- 
onal activity’’**; profilin2 was also identified in our Shank3 in vivo 
immunoprecipitation (Supplementary Table 4); and Mena and pro- 
filin interact with proteins in the Shank3 interactome (Extended Data 
Fig. 8). Both proteins showed decreased localization to inhibitory 
synapses, and profilin2 showed enhanced localization to excitatory 
synapses in transgenic neurons (Fig. 5g-i). Together, these results 
suggest that increased expression of Shank3 enhances F-actin levels 
through Arp2/3 complex in excitatory synapses of transgenic neu- 
rons. By modulating the synaptic distributions of Mena and profilin2, 
this contributes, at least in part, to changes in inhibitory synapses. 


Shank3 transgenic mice respond to valproate 

Having established that Shank3 overexpression causes mania-like 
behaviour in mice and a hyperkinetic syndrome in humans, we tested 
whether mood-stabilizing drugs for human mania could rescue the 
behavioural abnormalities of transgenic mice. First, we examined the 
effect of lithium, the first Food and Drug Administration (FDA)- 
approved treatment for bipolar disorder, which inhibits multiple targets 
including inositol monophosphatase and glycogen synthase kinase 3 
(GSK3)°°. The basal activity of GSK-3 in transgenic mouse hippocam- 
pus and striatum was similar to that of wild-type mice (Fig. 6a). Mice 


neurons from transgenic mice. Scale bar, 10 um. c, Frequency, but not 
amplitude and decay, of mIPSC was decreased in CA1 pyramidal neurons of 
transgenic mice (wild type n = 14, transgenic n = 18). d, Amplitude, but not 
frequency and decay, of sEPSC was increased in transgenic neurons (n = 19). 
e, AMPA/NMDA ratio was normal in transgenic neurons (n = 9). All data are 
presented as mean + s.e.m. *P < 0.05; **P < 0.01. Statistical analyses for these 
data are in Supplementary Table 2 and 3. 


were fed with lithium carbonate-containing chow for three weeks, 
resulting in serum lithium levels (wild type: 0.82 + 0.04; transgenic: 
0.91 + 0.08 mmol1~') comparable to those in the therapeutic range 
for humans (0.6~1.2 mmol] '). Lithium did not mitigate any of the 
manic-like behaviours of Shank3 transgenic mice (Fig. 6b and Extended 
Data Fig. 9). 

Given that our mice display seizures in addition to manic-like beha- 
viour, we decided to test valproate, another FDA-approved mood- 
stabilizing drug used to treat manic or mixed episodes that also acts 
as an anticonvulsant, increases GABA neurotransmission and inhibits 
sodium channels, calcium channels and histone deacetylase 1 (ref. 36). 
We treated the transgenic mice with acute intraperitoneal injections 
of either valproate (200 mg per kg) or saline and subjected them to 
behavioural testing. Valproate reversed the baseline hyperactivity, the 
amphetamine-hypersensitivity (Fig. 6c, d), and the abnormal acoustic 
startle response and PPI (Fig. 6e-h) in transgenic mice. The behaviour 
of wild-type mice, except for a decrease in baseline activity and an 
increase of PPI in male mice, was not significantly affected by valproate 
treatment (Fig. 6c—h). Valproate also significantly decreased the fre- 
quency of epileptiform spikes in transgenic mice as measured by EEG 
(Fig. 6i). Valproate treatment changed neither the levels of synaptic 
proteins in hippocampus or striatum of wild-type and transgenic mice, 
nor F-actin levels in cultured hippocampal neurons of transgenic mice 
(Extended Data Fig. 10). 


Discussion 


There are many genomic loci for which both deletions and duplica- 
tions cause human neuropsychiatric disorders, indicating that proper 
dosage of some genes is critical for normal brain function*”**. In most 
cases, however, the genes causing the respective phenotypes are poorly 
defined, making it hard to understand pathogenesis and to develop 
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targeted therapies. Chromosome 22q13 spanning SHANK3 is one 
example. Although SHANK3 loss has been shown to cause ASD in 
humans, SHANK3 overexpression has never been definitively deter- 
mined to cause neuropsychiatric disease. In this study, we generated 
Shank3 transgenic mice that model a human duplication and identified 
patients with the smallest 22q13 duplications involving SHANK3. The 
remarkable similarity of the neurobehavioural phenotypes between the 
mouse model and patients with SHANK3 duplications supports the notion 
that SHANK3 overexpression causes a hyperkinetic neuropsychiatric 
disorder that approximates mania. Indeed, either a 50% reduction or 
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Figure 5 | Shank3 interacts directly with Arp2/3 
complex to increase F-actin levels in Shank3 
transgenic mice. a, Increased synaptic F-actin in 
cultured hippocampal pyramidal neurons from 
transgenic mice (n = 29, three independent 
experiments). Scale bar, 10 um. b, Actin 
cytoskeleton-related sub-network of Shank3 
interactome. Y2H, yeast two-hybrid. c, Increased 
ARPC2 cluster size in transgenic pyramidal 
neurons (n = 20). d, Diagram shows the proposed 
role of Shank3 as a platform for F-actin regulating 
proteins. e, Increased co-localization of ARPC2 
and WASF1 in transgenic pyramidal neurons 

(n = 22). f, Golgi-staining of CA1 pyramidal 
neurons shows more dendritic spines in transgenic 
mice (n = 40 neurons from 3 animals per 
genotype). g, h, Excitatory and inhibitory synaptic 
distributions of Mena (g) and profilin2 (h) in 
cultured CA1 pyramidal neurons. Yellow 
arrowheads indicate protein puncta co-localized 
with corresponding synaptic markers. 

i, Quantification of (g) and (h). Density of VGAT- 
positive Mena puncta is decreased in transgenic 
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50% increase in Shank3 levels results in marked neuropsychiatric phe- 
notypes and a predisposition to epilepsy. 

Notably, the manic-like behaviour of Shank3 transgenic mice was 
reversed selectively by valproate but not by lithium treatment. It is not 
uncommon to find human patients with bipolar disorder that are 
resistant to lithium monotherapy”. Indeed, a subset of individuals with 
the rapid-cycling form of bipolar disorder is highly resistant to lithium”. 
As GSK3 is considered the major target for lithium’s efficacy in the 
treatment of BPD*’, the lithium-resistance of Shank3 transgenic mice 
and their normal GSK-3f activity are congruent with a mechanism 
independent of GSK-3 and possibly more dependent upon synaptic 
alterations, including E/I imbalance. 

Synaptic E/I imbalance is likely to underlie the pathogenesis for 
seizures**** and a broad spectrum of neuropsychiatric disorders'*’. 
We found morphological and functional changes in E/I synapses of 
transgenic neurons favouring excitation, which might account for both 
seizures and hyperkinetic behaviour. Consistent with this, GABAergic 
dysfunction in patients with bipolar disorder has been reported“, and 
valproate (which increases GABA neurotransmission and decreases 
high-frequency action potential firing) rescued the manic-like beha- 
viours of transgenic mice. In contrast, lithium enhances excitatory 
postsynaptic potentials in CA1 synapses**, which might explain the resis- 
tance of Shank3 transgenic mice to lithium treatment. Mechanistically, 


Figure 6 | Valproate, but not lithium, rescues manic-like behaviours of 
Shank3 transgenic mice. a, Basal activities of GSK-3B and Akt in the 
hippocampus and striatum of transgenic mice (10-week-old, n = 7) are normal. 
b, Amphetamine-sensitivity of male transgenic mice was not rescued by 
lithium. NS, not significant. c, d, Basal locomotor activity and amphetamine- 
sensitivity of transgenic mice were rescued by valproate. Basal activity of wild- 
type male mice was decreased by valproate during one 5-min time bin. 

e-h Acoustic startle response (e, g) and PPI (f, h) of transgenic mice were 
rescued by valproate. Valproate increased PPI of wild-type male mice. i, Rescue 
of abnormal EEG in transgenic mice by valproate. During the three consecutive 
days of tests, EEG was recorded from the three brain regions of transgenic mice 
(n = 7) for one hour before and after valproate injection. Representative EEG 
traces measured on day 1 are shown. The number of epileptiform spikes was 
quantified and normalized to the baseline values before treatment of day 1. All 
data are presented as mean + s.e.m. *P < 0.05; **P < 0.01; ***P < 0.001. 
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enhanced F-actin polymerization through Arp2/3 complex could be 
responsible for the synaptic changes of transgenic neurons, but other 
Shank3-dependent molecular pathways cannot be excluded. 

In conclusion, our study advances our understanding of the human 
diseases associated with SHANK3 and the mechanisms that can lead 
to bipolar disorder. We propose two distinct mechanisms leading to 
bipolar disorder: one is dysfunction of the well-established GSK-3 
pathway that is responsive to lithium treatment; the second is a syn- 
aptic dysfunction associated with an E/I balance shifted towards excita- 
tion, which could result in hyperkinetic behaviours, and which is resistant 
to lithium but sensitive to valproate. We believe this sort of pharma- 
cogenetic analysis in model organisms could improve treatment strat- 
egies for patients with bipolar disorder as the genetic underpinnings of 
each patient’s illness are elucidated. 


METHODS SUMMARY 


BAC clone (RP23-278D8) containing entire mouse Shank3 was modified by recom- 
bineering techniques. The linearized segment including EGFP-Shank3 plus ~12 kb 
in the 5’ direction and ~2kb in the 3’ direction was injected into the FVB/N 
embryos to generate transgenic mice. Behavioural, electrophysiological, biochem- 
ical and cell biological characterization of the mice were performed and analysed 
blind to genotypes. All experimental procedures were reviewed and approved by 
the Institutional Animal Care and Use Committee for Baylor College of Medicine. 
The details of experimental procedures, reagents and bioinformatic analyses are 
described in the Methods. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Generation of Shank3 transgenic mice. To generate Shank3 transgenic mice, we 
used a BAC clone (RP23-278D8) containing a segment of mouse chromosome 15. 
This BAC clone was modified by recombineering techniques" to insert Kozak 
sequence (ACCATGG) followed by EGFP sequence (cloned from pEGFP-C1) at 
the first start codon of Shank3 gene (exon 1). The modified BAC clone was double 
digested with NotI/Swal, and the ~75 kb linearized segment with entire Shank3 
gene plus ~12 kb 5’ and ~2 kb 3’ was injected into the FVB/N embryos (Extended 
Data Fig. la). The primers used for genotyping were designed against EGFP 
sequence (forward: 5'-ATGGTGAGCAAGGGCGAGGAG-3’, reverse: 5'-GCGG 
ACTTGAAGAAGTCGTGC-3’). All procedures to maintain and use these mice 
were approved by the Institutional Animal Care and Use Committee for Baylor 
College of Medicine. 

Primary antibodies. Antibodies used for the western blots and/or immunostainings 
are Akt (Cell Signaling Technology; 4691), phospho-Akt (Thr 308) (Cell Signaling 
Technology; 2965), ARPC2 (Millipore; 07-227), B-actin (Abcam; ab20272), CYFIP2 
(Abcam; ab95969), GABAaR B3 (NeuroMab; 75-149), GAD-6 (Developmental 
Studies Hybridoma Bank), Gephyrin (Synaptic Systems; 147 111), GFP (Abcam; 
ab290, ab13970), GKAP (NeuroMab; 75-156), GluR2 (NeuroMab; 75-002), GSK- 
30/8 (Millipore; 05-412), phospho-GSK-3B (Ser 9) (Cell Signaling Technology; 
9336), Homer (Santa Cruz; sc-15321), MAP2 (Sigma; M9942), Mena/Vasp (Millipore; 
MAB2635), mGluRS5 (Millipore; AB5675), NR1 (Millipore; MAB363), Profilin2 
(TU-Braunschweig; 4H5), PSD-95 (NeuroMab; 75-028), pan-Shank (3856; (ref. 47)), 
Shank1 (NeuroMab; 75-064), Shank2 (NeuroMab; 75-088), Shank3 (Santa Cruz; 
H-160), VGAT (Synaptic Systems; 131 002), VGlut1 (Synaptic Systems; 135 302) 
and WASF1 (NeuroMab; 75-048). 

Preparation of brain lysates and western blot. S2 (soluble fraction) and P2 
(crude synaptosomal fraction) subcellular, and PSD fractions (synaptic fraction 
after one time Triton X-100 washout) were prepared as described**. For Akt and 
GSK3 detection, brains were dissected and homogenized in ice-cold lysis buffer 
(20 mM Tris-HCl, pH 7.4, 150mM NaCl, 2mM EDTA, 1% Triton X-100, 10% 
glycerol with protease and phosphatase inhibitor (Roche)). Western blot images 
were acquired by LAS 4000 (GE Healthcare) and quantified by an ImageJ software 
package. 

Hippocampal neuron cultures, transfection and immunostaining. Hippo- 
campal neurons were prepared from postnatal day 0-1 FVB/N mice and plated 
on poly-p-lysine/mouse laminin coated coverslips (BD Biosciences) in Neurobasal 
medium supplemented with GlutaMAX-I (Invitrogen), B-27 and 1% FBS. At days 
in vitro (DIV) 7, neurons were transfected with pEGFP-C1 plus either control or 
Shank3 siRNA (Ambion, s81603 or s81605). At DIV 14, neurons were fixed with 
4% formaldehyde/4% sucrose in PBS, and permeabilized with 0.2% Triton X-100 
in PBS. PBS with 0.1% BSA and 3% horse serum was used for blocking and 
antibody incubation. Alexa-conjugated phalloidin (Invitrogen) was used to visu- 
alize F-actin. Z-stack images were acquired by LSM710 (Zeiss) confocal micro- 
scope under the same parameter settings and the images were quantified (in a 
blinded manner) using ImageJ. 

RNA in situ hybridization. Hybridization probe specific for EGFP sequence was 
prepared by PCR amplification from the pEGFP-C1 using the following primers 
(forward: 5'‘-ATGGTGAGCAAGGGCGAGGAG-3’, reverse: 5’-CTTGTACAG 
CTCGTCCATGCC-3'). Riboprobe was generated using DIG RNA labelling mix 
(Roche). 

Golgi staining. Standard Golgi-Cox impregnation using the FD Rapid GolgiStain 
kit (NeuroTechnologies) was performed with brains from 10-week-old FVB/N 
male mice. Serial sagittal sections (50 lum) were collected and images of dendritic 
spines on the secondary branches (apical dendrites of CA1 pyramidal neurons) 
were acquired by LSM710 (Zeiss) confocal microscope under differential inter- 
ference contrast (DIC) mode. The images were quantified (in blinded manner) 
using Image]. 

Cresylviolet staining. Brains from 10-week-old FVB/N male mice were cryosec- 
tioned (25 tm) and stained with 0.2% cresyl violet. 

Drug treatment. For each experiment, individual was blinded to genotype and 
littermates were randomly distributed to control and treatment groups as follows; 
Animal with genotype A was assigned to control, next animal of genotype A to 
drug, third to control. Same with genotype B. Amphetamine and valproate 
(Sigma) were dissolved in normal saline to final concentration of 0.2 g1~' and 
20 gl”, respectively. Mice received intraperitoneal injection of amphetamine 
(2 mg per kg), valproate (200 mg per kg) or saline in a volume of 10 ml kg’. 
Valproate was injected three times before each behavioural assay (10:00 and 17:00 
on the day prior to the assay and 9:00 of the day of assay), and assays were 
performed 30 min after final injection. For lithium treatment, mice were fed 
either lithium carbonate-containing chow or control chow (Harlan Teklad). 
The lithium group was initially fed 0.2% lithium carbonate chow for a week 
followed by 0.4% chow for two more weeks before the behavioural assays. 


Water with 0.85% sodium chloride was provided to counteract toxicity of lithium. 
Lithium concentration in serum was measured using lithium assay kit (Crystal 
Chem) according to the manufacturer’s instruction. 

Behavioural assays. All data acquisition and analyses were carried out by an 
individual blinded to the genotype. For behavioural assays, 2-3-month-old mice 
(both males and females) of the Fl hybrid (FVB/N X C57BL/6J) were used to 
mitigate strain specific effects. A sample size of mice was chosen to mitigate 
genetic background variance. Before each test, mice were habituated in the test 
room at least for 30 min. Mice with sign of seizures (before, during or after test) 
were excluded from the analysis. The summary of statistical analysis for beha- 
vioural assays is provided in Supplementary Table 1. 

Open field. After habituated in the test room (600 Ix, 60 dB white noise), mice 
were placed in the centre of a clear, open Plexiglass chamber (40 X 40 X 30 cm), 
and the activities were measured by photobeam breaks (Accuscan) for 30 min. To 
measure amphetamine response, mice received intraperitoneal injection of saline 
or amphetamine after 30 min test of basal activities, and the activities were mea- 
sured for additional 60 min. 

Home-cage activity. Single-caged mice were habituated in the cage for 12 h, and 
the activities were measured by photobeam breaks (Accuscan) for 48 h under the 
light cycle of 07:00-19:00. 

Tail suspension. After habituation period in the test room (600 Ix, 60 dB white 
noise), mice were suspended by their tails. Movements were recorded and the 
immobile time was automatically measured by ANY-maze software (Stoelting). 
The same parameter setting for the definition of immobility was applied for all the 
mice tested. 

Acoustic startle response and prepulse inhibition. Mice were placed in a test 
chamber (San Diego Instruments) and habituated for 5 min with 70 dB back- 
ground white noise. Eight trial types (no stimulus, a 40 ms 120 dB sound as the 
startle stimulus, and three different prepulse sounds (20 ms 74, 78, 82 dB) either 
alone or 100 ms before the startle stimulus) were presented in pseudo-random 
order with six times per each trial type. The interval between each trial type was 10 
to 20s. The maximum startle amplitude during the 65 ms period following the 
onset of the startle stimulus was used to calculate percentage prepulse inhibition. 
Circadian rhythms. Mice were individually housed in cages equipped with a 
11.5cm diameter wheel, and wheel-running activity was recorded by VitalView 
software (Respironics Mini-Mitter system). Mice were held on a 12:12 light/dark 
cycle for 8 to 14 days with food, water and temperature at 20 + 2 °C. Mice were 
then released into constant dark for 14 days. 

Three chamber test. The test was performed as previously described”. The three 
chamber apparatus is a clear Plexiglass box (24.75 X 16.75 X 8.75 inches) with 
removable partitions separating the box into left, centre and right chambers. The 
age- and gender-matched C57BL/6] mice were used as novel partners. Two days 
before the test, the novel partner mice were habituated to the wire cages (3 inches 
diameter by 4 inches in height) for 1h per day. The wire cage with inanimate 
novel object inside served as a control. The test mouse was habituated in the 
chamber for 15 min as described”. During the habituation period, neither wild- 
type nor Shank3 transgenic mice showed a preference for either side of the 
chamber (data not shown). After the habituation period, the novel partner mouse 
was placed into one of the wire cages and located randomly in either the right or 
left side of chamber. The novel object was placed into the wire cage located in the 
other side of chamber. The test mouse was allowed to explore the three chamber 
apparatus for 15min. The movements were recorded and the total amount of 
time spent in each chamber was automatically measured by ANY-maze software 
(Stoelting). The close interaction time, defined by rearing, sniffing or pawing at 
each wire cage, was measured manually. 

Grooming. After habituation period in the test room (600 lx, 60 dB white noise), 
mice were placed into the centre of a Plexiglass cylinder (20 cm in diameter by 
30 cm in height) and videotaped for 10 min. The amount of time spent grooming 
was measured from the videotape. 

Ultrasonic vocalization. Separation-induced ultrasonic vocalizations (USV) 
were measured on pups of postnatal day 6 to 13. A standard housing cage contain- 
ing one female and her offspring was transferred to a holding room and allowed to 
acclimate for 30 min. Following this acclimation period, the cage was transferred 
to a testing room. Pups were placed into a clean and warm (35 °C) cage and allowed 
to acclimate for 5min. The mother was removed from the testing room and 
transferred back to the holding room. After 5 min acclimation period, one pup 
will be placed individually into a plastic beaker in a testing chamber. A bat detector 
positioned 10 cm above the beaker transduced ultrasonic signals. USV was quan- 
tified for a total testing period of 2 min using sound analysis software (Ultravox, 
Noldus Information Technology). 

EEG measurement. All data acquisition and analyses were carried out blinded to 
genotype. FVB/N 2-3-month-old male mice were anaesthetized with isoflurane. 
Under aseptic conditions, each mouse was surgically implanted with tungsten 
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electrodes (501m diameter) aimed at hippocampal CA1 (P2.0R1.2H1.3) and 
dentate (P2.0R1.8H1.8) regions. The third EEG electrode made of silver wire 
(127 um diameter) was implanted in the subdural space of the right frontal cortex 
area. A reference/grounding electrode was then positioned in the occipital region 
of the skull. All electrode wires were attached to a miniature connector (Harwin 
Connector). After 3 to 5 days of post-surgical recovery, EEG activities (filtered 
between 0.1 Hz and 5 kHz, sampled at 10 kHz) were recorded for 1 h per day over 
3 to 5 days. In the experiments evaluating the effects of valproate on EEG activities, 
adult male Shank3 transgenic mice with implanted electrode went through record- 
ing sessions for 3 consecutive days. On each day, animals received one hour 
baseline recording before valproate (200 mg per kg, ip.) followed by another hour 
of post recording. 

EEG data analysis. Electrographic seizure time lasting over 10s was counted 
manually and normalized as the percentage of the total recording period. The 
number of abnormal epileptiform spikes were counted using Clampfit 10 soft- 
ware (Molecular Devices) when the sharp positive deflections exceeding twice the 
baseline and lasting 25-100 ms*’. In the pharmacological tests, spike numbers 
were normalized to the baseline value before drug administration on day 1. 
Whole-cell patch-clamp recordings. All data acquisition and analyses were car- 
ried out blinded to genotype. Acute fresh hippocampal slices were prepared from 
FVB/N male mice at the age of 6-8-weeks old as previously described*'. Coronal 
slices (250-m thick) containing medial hippocampus were cut with a vibratome 
(Leica Microsystems) in a chamber filled with chilled (2-5 °C) cutting solution 
containing (in mM) 110 choline-chloride, 25 NaHCO3, 25 D-glucose, 11.6 
sodium ascorbate, 7 MgSOx, 3.1 sodium pyruvate, 2.5 KCl, 1.25 NaH2PO, and 
0.5 CaCl). The slices were then incubated in artificial cerebrospinal fluid (ACSF, 
in mM) containing 119 NaCl, 26.2 NaHCO3, 11 D-glucose, 3 KCl, 2 CaCh, 
1 MgSOy, 1.25 NaH,PO, at the room temperature. The solutions were bubbled 
with 95% O2 and 5% CO2. Whole-cell recording was made using patchclamp 
amplifiers (Multiclamp 700B) under infrared-differential interference contrast 
microscopy (Zeiss). Data acquisition and analysis were performed using digitizers 
(DigiData 1440A) and analysis software pClamp 10 (Molecular Devices). Signals 
were filtered at 2 kHz and sampled at 10 kHz. Spontaneous EPSCs (sEPSCs), 
miniature EPSCs (mEPSCs) and IPSCs (mIPSCs) were recorded from CA1 hip- 
pocampal pyramidal neurons at —70 mV in voltage-clamp mode. sEPSCs were 
recorded in the presence of GABA, receptor blocker, SR95531 (10 uM). mEPSCs 
were recorded in the presence of tetrodotoxin (TTX, 0.54M) and SR95531 
(10 1M). Glass pipettes with a resistance of 2.5-5 MQ were filled with a solution 
containing (in mM) 140 potassium gluconate, 5 KCl, 10 HEPES, 0.2 EGTA, 2 
MgCl, 4 MgATP, 0.3 Na,GTP and 10 Na)-phosphocreatine, pH 7.2 (with KOH). 
For recording mIPSCs, potassium gluconate was replaced with KCl (140 1M) in 
the pipette solution. Besides TTX (0.5M), glutamate receptor antagonists 
6-cyano-7-nitroquinoxaline-2,3-dione disodium (CNQX) (101M) and D-2- 
amino-5-phosphonovaleric acid (D-AP-5) (25 uM) were also added to the bath. 
Series resistance (15-30 MQ) and input resistance (100-200 MQ) were moni- 
tored throughout the whole-cell recording. Evoked EPSCs were recorded in the 
presence of SR95531 (10M). To stimulate presynaptic inputs, stimulation 
pulses (50 is duration at 0.1 Hz) were applied to a bipolar tungsten stimulation 
electrode (WPI) placed along the Schaffer collaterals. Extracellular stimuli were 
administered using a stimulator system (Master 8). The stimulating and record- 
ing pipettes were placed at the same depth in the slice and the distance between 
them was kept constant (~300 jim). Data were discarded when the change in the 
series resistance was >20% during the course of the experiment. The whole-cell 
recording was performed at 30 + 1 °C with the help of an automatic temperature 
controller (Warner Instruments). For measurements of the AMPA-to-NMDA 
receptor ratio, the CA1 pyramidal neuron was voltage-clamped at 40 mV. First, a 
stable baseline recording of total EPSCs was obtained. The NMDA receptor 
antagonist D-AP-5 (50 mM) was then applied to the bath for 5-10 min to isolate 
fast AMPA-receptor-mediated EPSCs. Digital subtraction of AMPA-receptor- 
EPSCs from the total EPSCs from the same neuron yielded NMDA-receptor- 
EPSCs. An average of 12-20 EPSCs were collected for each type of EPSC. The 
bath solution contained picrotoxin (100M). The intracellular solution con- 
tained 140mM CsCH3SO3, 10mM HEPES, 1 mM EGTA, 5mM TEA-Cl, 2mM 
MgCl, 2.5mM MgATP and 0.3mM GTP, pH 7.2-7.4 (with CsOH). 
Extracellular field recordings. All data acquisition and analyses were carried out 
blinded to genotype. One- to two-month-old FVB/N male mice were anaesthe- 
tized with isoflurane and their brains were removed and immersed immediately 
in ice-cold cutting solution (in mM): 110 sucrose, 3 KCl, 0.5 CaCh, 60 NaCh, 7 
MgCh, 1.25 NaH,PO4, 28 NaHCOs, 5 D-glucose saturated with 95% O, and 5% 
COs. 400-pm thick transverse hippocampal slices were prepared with a vibrating 
microtome Series 1000 (Vibratome) and recovered at 31 + 0.5 °C for an hour in 
an interface chamber. Field excitatory postsynaptic potential (fEPSP) were recorded 
at 31+0.5°C in an interface chamber perfused at 1ml min ' with artificial 


ARTICLE 


cerebrospinal fluid (ACSF) containing (in mM): 125 NaCl, 2.5 KCl, 2 CaCh, 1 
MgCl, 1.25 NaH,PO,, 25 NaHCOs, and 15 p-glucose. Extracellular stimuli were 
administered along the Schaffer collaterals using Formvar-insulated, bipolar 
nichrome electrodes controlled by a stimulus isolator (A-M Systems). An ACSF- 
filled glass-recording electrode was placed in stratum radiatum to record the field 
potential changes. Electrophysiological traces were amplified with AC-coupled 
amplifier (model 1800; A-M Systems), digitized using a Digidata 1320A (Molecular 
Devices), and acquired with pClamp 10 software (Molecular Devices). To assess 
baseline synaptic transmission, input-output relationships were examined by 
measuring the rising slope of the fEPSP evoked by 100-s pulses over various 
stimulus intensities (1 V to 10 V). The stimulation intensity that evoked a fEPSP 
whose slope was 30-40% of the maximum fEPSP slope, determined by the input- 
output recording experiment, was used for the following recording paradigms. 
Long-term potentiation (LTP) was induced by two trains of 100 stimuli pulses at 
100 Hz that are separated 20s apart. To monitor LTP development, the fEPSPs 
were recorded every 20s for 20 min before and 60 min after induction. The mag- 
nitude of potentiation was determined by measuring the changes in the slope of the 
fEPSP. NMDAR-dependent long-term depression (LTD) formation was induced 
with 900 paired stimuli with 50 ms inter-stimulus interval at 1 Hz frequency. In all 
experiments, genotypes were confirmed by PCR with tails cut. All data are shown 
as mean + s.e.m. 

In vivo immunoprecipitation and mass spectrometry. Striatum and hippocam- 
pus were dissected from 8 animals (5-weeks-old) per each genotype (wild-type and 
Shank3 transgenic FVB/N male mice), and crude synaptosomal fraction solubi- 
lized with deoxycholic acid (DOC) buffer was prepared as described previously”. 
Then ~10 mg of lysates were incubated with GFP-Trap beads (ChromoTek) for 
2h at 4°C. The beads were briefly washed with binding/dialysis buffer (50 mM 
Tris-HCl, pH 7.4, 0.1% Triton X-100) and boiled in with 1X NuPAGE LDS sample 
buffer (Invitrogen) to be loaded on SDS-PAGE (NuPAGE 4-12% Bis-Tris Gel, 
Invitrogen). The eluted proteins were visualized with Coomassie Brilliant blue 
stain and excised into 10 gel pieces according to molecular size. The individual 
gel pieces were destained and subject to in-gel digestion using trypsin. Tryptic 
peptide was dissolved in 10 pil of loading solution (5% methanol containing 0.1% 
formic acid) and subjected to nanoflow LC-MS/MS analysis with a nano-LC II 
(Thermo Scientific) coupled to LTQ Orbitrap Velos (Thermo Scientific) mass 
spectrometer. The peptides were loaded onto an in-house Reprosil-Pur Basic 
C18 (3 um, Dr. Maisch) trap column which was 2 cm X 75 tum size. Then the trap 
column was washed with loading solution and switched in-line with an in-house 
100mm X 75 um column packed with Reprosil-Pur Basic C18 equilibrated in 
0.1% formic acid/water. The peptides were separated with a 75 min discontinuous 
gradient of 5-28% acetonitrile/0.1% formic acid at a flow rate of 450nl min. 
Separated peptides were directly electro-sprayed into LTQ Orbitrap Velos mass 
spectrometer. The LTQ Orbitrap instrument was operated in the data-dependant 
mode acquiring fragmentation spectra of the top 50 strongest ions and under direct 
control of Xcalibur software (Thermo Scientific). Obtained MS/MS spectra were 
searched against target-decoy mouse refseq database in Proteome Discoverer 1.3 
interface (Thermo Fisher) with Mascot algorithm (Mascot 2.3, Matrix Science). 
The precursor mass tolerance was confined within 20 p.p.m. with fragment mass 
tolerance of 0.5 Da and a maximum of two missed cleavage allowed. Assigned 
peptides were filtered with 5% false discovery rate (FDR) and subject to manual 
verifications. Any protein whose peptide was identified from wild-type brain 
sample was excluded from the further analysis. 

Construction of Shank3 interactome network. To build an interaction network 
with the union set of Shank3 interactors (in vivo immunoprecipitation plus yeast 
two-hybrid screening), protein interactions were adopted from iRefIndex data- 
base (http://irefindex.org) which provides consolidated protein interactions from 
the 10 primary interaction databases (BIND, BioGRID, CORUM, DIP, HPRD, 
IntAct, MINT, MPact, MPPI and OPHID). Only the interactions among human 
and mouse genes were included. The direct interactions were distinguished from 
the indirect interactions based on experimental assays. Network graphics were 
generated with Cytoscape**. Nodes and edges were coloured based on the source 
and interaction type of the proteins, respectively. Thickness of edges for direct 
interactions correlates with the number of supportive evidences in the interaction 
database (Fig. 5b). To simplify the network, orphan nodes, defined by the nodes 
which have interaction only with Shank3, were excluded from the network 
(Extended Data Fig. 5c). 

Network topology analysis of Shank3 interactome. Shortest path length (the 
number of links of the shortest path travelling from one protein to another 
protein) was calculated to measure the network topology of Shank3 interactome. 
To test the significance, we randomly picked the same number of proteins from 
the mouse PSP” and calculated shortest path length of the interactome network 
generated with the proteins. This re-sampling was repeated 10,000 times and the 
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empirical P value (the number of sample networks whose mean path length is 
smaller than that of Shank3 interactome/10,000) was calculated. 

Gene ontology (GO) analysis. The GO analysis was carried out using DAVID 
software (version 6.7)*>**. The union set of Shank3 interactors from the in vivo 
immunoprecipitation and yeast two-hybrid screening was tested against a cus- 
tomized background gene list in which genes expressed at low levels in mouse 
brain were excluded. 

Human subjects. Both individuals were referred to the Medical Genetics Labo- 
ratories, Baylor College of Medicine, Houston, USA, for clinical array compara- 
tive genomic hybridization (aCGH) analysis. Following informed consent, approved 
by the Institutional Review Board for Human Subject Research at Baylor College of 
Medicine, the medical records of the respective individuals were reviewed. Addi- 
tional clinical information was obtained by phone interview with patient 1’s legal 
guardian and patient 2, respectively. 

Array comparative genomic hybridization. Patient DNA, isolated from peri- 
pheral whole blood using the Puregene DNA extraction kit (Gentra) following the 
manufacturer’s instructions, was analysed using the Baylor College of Medicine 
V8 OLIGO clinical genomic microarray, described”. Briefly, this is a custom- 
designed genomic microarray with both genome-wide coverage and supplement- 
ary exonic coverage of ~1,700 known or suspected disease genes, including 
SHANK3. 

Quantification and statistical analysis. For quantification, values from three 
independent experiments with at least three biological replicates were used. For 
behavioural assays, all population values appear normally distributed and the 
variance is similar between the groups. P values were calculated by Student’s 


t-test or analysis of variance with proper post-hoc tests (GraphPad Prism), as 
specified in each figure legend or Supplementary Tables 1-3. All data are pre- 
sented as mean + s.e.m. *P < 0.05; **P< 0.01; ***P< 0.001. 
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Extended Data Figure 1 | Characterization of expression patterns of 
EGFP-Shank3 and other synaptic proteins in Shank3 transgenic mice. 

a, Description of the Shank3 BAC used for transgenic mice generation 
(image was modified from UCSC genome browser). b, Western blot images 
show expression of EGFP-Shank3 in the synaptic fraction of Shank3 transgenic 
brain lysates. EGFP antibody recognized EGFP-Shank3 (~200 kDa) 
specifically in synaptic fractions of Shank3 transgenic mice. Asterisk indicates 
non-specific bands detected by the EGFP antibody. Shank3 antibody detected 
endogenous Shank3 plus EGFP-Shank3 (arrow) in the transgenic samples. S2, 
soluble fraction; P2, crude synaptosomal fraction; PSD I, synaptic fraction after 
one time Triton X-100 washout. c, The brain regional expression pattern of 


Seow 


me Hp mee St 


EGFP-Shank:3 is similar to that of endogenous Shank3. Cb, cerebellum; 

Ct, cortex; Hp, hippocampus; St, striatum; Th, thalamus. d, The brain 
developmental expression pattern of EGFP-Shank:3 is similar to that of 
endogenous Shank3. e, The fold changes of Shank3 proteins in 3-week-old 
transgenic mice (m = 4) are similar to those of 6-week-old mice (Fig. 1d). 

f, Female transgenic mice (n = 4) show similar fold changes of Shank3 proteins 
to male transgenic mice (Fig. 1d). g, h, Expression levels of excitatory (g) and 
inhibitory (h) synaptic proteins are not significantly altered, except Shank3, 
in the synaptosomal fraction of 8-week-old transgenic hippocampus and 
striatum (n = 6). All data are presented as mean + s.e.m. P values (*P < 0.05) 
were derived from unpaired, two-tailed Student’s t-test. 
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Extended Data Figure 2 | Behavioural characterization of Shank3 
transgenic mice. a, Transgenic mice showed increased locomotor speed 
during the 30 min open-field assay. b, Female transgenic mice showed 
increased home-cage activity. c, Transgenic mice have increased body weight 
compared to wild-type mice measured at 9-weeks-old (male wild type: 

28.1 + 0.9, male transgenic: 30.6 + 0.7, female wild type: 20.1 + 0.3, female 
transgenic: 21.4 + 0.4, n = 10-13; *P < 0.05; unpaired two-tailed Student’s 
t-test) and 15-weeks-old (male wild type: 32.4 + 1.2, male transgenic: 

37.0 + 1.3, female wild type: 23.6 + 0.4, female transgenic: 26.6 + 0.9, 

n= 10-13; *P < 0.05). d, Increased food intake by transgenic mice. Food intake 
was measured from single-caged female transgenic or wild-type mice from 6- to 
10-weeks-old. At 7-weeks-old (wild type: 3.51 + 0.05, transgenic: 3.69 + 0.06, 
n= 9-11; *P < 0.05; unpaired two-tailed Student’s t-test) and 9-weeks-old 
(wild type: 3.61 + 0.07, transgenic: 3.86 + 0.05, n = 9-11; *P < 0.05), food 


intake by transgenic mice was significantly higher than wild-type mice. e, f, In 
the 3-chamber assay, male (e) and female (f) transgenic mice did not show 
significant preference for novel mice to novel objects. g, Transgenic mice spent 
significantly less time in close interaction with novel social partners. 

h, i, Stereotypy time (h) and activity count (i) during the 30 min open-field test 
were normal in transgenic mice. j, Time spent in grooming during 10 min 
monitoring was normal in transgenic mice. k, Separation-induced ultrasonic 
vocalization was measured from wild-type and transgenic mice of postnatal 
day 6 to 13. Transgenic mice made less calls than wild type at postnatal day 13. 
1, At postnatal day 13, transgenic mice express more Shank3 proteins than 
wild-type mice. All data are presented as mean + s.e.m. *P < 0.05, **P < 0.01, 
***P <().001. The summary of statistical analyses for behavioural assays is 
provided in Supplementary Table 1. 
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Extended Data Figure 3 | Behavioural Phenotypes of Shank3 transgenic significantly decreased the levels of « isoform and total Shank3 protein. 
mice were rescued by crossing with Shank3B*'~ mice. a, Schematic diagram cc, d, Locomotor activities of male (c) and female (d) Shank3 transgenic mice 


shows possible genotypic combinations and their Shank3 expression levels were rescued by crossing with Shank3B*'~ mice. e, f, Immobile time in tail- 

(arrow) from the crossing between Shank3 transgenic mice and Shank3B*!~ suspension test of male (e) and female (f) Shank3 transgenic mice were rescued 

mice. b, Quantification of the levels of Shank3 proteins in each genotype by crossing with Shank3B*'~ mice. g, Prepulse inhibition (by 78 dB) of female 
S) 


(n = 4). The fold changes were compared to the wild-type (Shank3B* Shank3 transgenic mice was rescued by crossing with Shank3B*'~ mice. All 
controls. Crossing of Shank3 transgenic mice with Shank3B*’~ mice data are presented as mean + s.e.m. *P < 0.05; ***P <0.001. 
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Extended Data Figure 4 | Basal synaptic transmission and NMDA receptor- 
dependent synaptic plasticity are normal at the hippocampal Schaffer 
collateral-CA1 synapses of Shank3 transgenic mice. a, Images of cresylviolet 
staining show normal cytoarchitecture of transgenic brain. b, Normal paired- 
pulse facilitation ratio (PPR) of evoked EPSCs (eEPSCs) at transgenic Schaffer 
collateral-CA1 synapses. c, d, Normal synaptic input-output (I-O) relationship 
at transgenic Schaffer collateral-CA1 synapses. I-O curve (c) and cumulative 
curve of I-O slope (d) are shown. Red lines in c represent the fitting curves. 
e, Left, sample traces of action potentials triggered by 200 pA current injection 
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in hippocampal CA1 region of wild-type and transgenic mice. Middle and 
right, unaltered number of action potentials trigged by the injection of current 
at different level (middle) and amplitude of the first action potential triggered 
by 800-ms-long pulse of 200 pA current (right) show normal intrinsic 
excitability of hippocampal CA1 pyramidal neurons in transgenic mice. 

f, Amplitude, frequency and decay of mEPSC are not altered in CA1 pyramidal 
neurons of transgenic mice. g, h, NMDA receptor-dependent long-term 
potentiation (g) and long-term depression (h) at Schaffer collateral-CA1 
pyramidal synapses are normal in transgenic neurons. 
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Extended Data Figure 5 | Generation and characterization of Shank3 in line), supporting strong connectivity of Shank3 interactome. e, GO and KEGG 
vivo interactome. a, Isolation of EGFP-Shank3 protein and its interactors pathway analysis of Shank3 interactome reveal enrichment of actin 
from synaptosomal fraction of transgenic mice. b, Venn diagrams show cytoskeleton-related function/pathway. The full result of analysis is in 
overlaps among the protein lists from Shank3 in vivo immunoprecipitation, Supplementary Table 5. f, Confirmation of the in vivo interactions between 
Shank3 yeast two-hybrid screening, and either published mouse PSP Shank3 and actin-related proteins. g, In cultured hippocampal neurons, 
(postsynaptic proteome) or human PSD. c, Shank3 interactome network. ARPC2 proteins are co-localized with F-actin (upper panel) and EGFP-Shank3 


d, Average path length of Shank3 interactome (3.12, red line) is significantly (lower panel). 
(P < 0.0001) shorter than that of mouse PSP interactome (mean = 3.91, black 
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Extended Data Figure 6 | Shank3 siRNA reversed increased F-actin levels 


and ARPC2 cluster size in cultured hippocampal pyramidal neurons from 


Shank3 transgenic mice. a, Validation of the two siRNAs against Shank3. 


HEK293T cells were transfected with HA-Shank3 plus control or Shank3 
siRNA. EGFP plasmid was co-transfected as an internal control. After 48 h, 


expression levels of HA-Shank3 were measured by western blot and quantified 
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(n = 3). b, siRNA targeting Shank3 (si-Shank3) reversed the increased F-actin 
levels of Shank3 transgenic neurons. Image for si-Shank3 #2 is not shown. 

c, si-Shank3 reversed the increased ARPC2 cluster size of Shank3 transgenic 
neurons. All data are presented as mean =~ s.e.m. from 20-30 neurons per 
condition. P values (*P < 0.05, **P < 0.01) were derived from one-way 
ANOVA with post hoc Tukey’s multiple comparison. 
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Extended Data Figure 7 | Normal excitatory and inhibitory synapse 
numbers of GAD-6-positive inhibitory neurons of Shank3 transgenic mice. 
a, Expression of EGFP-Shank3 in GAD-6-positive inhibitory neurons of 
Shank3 transgenic mice. The yellow circles of dotted line and solid line indicate 
neuronal cell bodies of a GAD-6-negative excitatory neuron and a GAD-6- 
positive inhibitory neuron, respectively. The box 1 shows dendritic segment of 
the excitatory neuron and the box 2 shows that of the inhibitory neuron. 

b, Quantification of dendritic EGFP-Shank3 intensity in excitatory and 
inhibitory transgenic neurons. Inhibitory neurons express less EGFP-Shank3 
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(excitatory neurons: 1.00 + 0.06, inhibitory neurons: 0.74 + 0.09, n = 10; 

*P < 0.05; unpaired two-tailed Student’s t-test). c, Normal excitatory synapse 
number of GAD-6-positive inhibitory neurons of Shank3 transgenic mice. 

d, Normal inhibitory synapse number of GAD-6-positive inhibitory neurons of 
Shank3 transgenic mice. e, Quantification of (c) (wild type: 1.00 + 0.10, 
transgenic: 0.99 + 0.07, n = 18; P> 0.05; unpaired two-tailed Student’s t-test) 
and (d) (wild type: 1.00 + 0.12, transgenic: 0.95 + 0.09, n = 20; P > 0.05; 
unpaired two-tailed Student’s f-test). 


©2013 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


ACTNig, MYOSBe 


ACTN2@ _ABI2@ | PTPRS@ 


PINKS ACTA ___PEN2g 


| Ménal 
SHANK3q{— SORBS2@_ WASF 1, VASP _ GPHN@ 


BAIAP2¢” _ABIt}y) PENA 
ATNig é DNW2@ CRMP1gy 
VIM» Hes 
© Shank3 


@ Shank3 interactome 

@ Actin-related interactors of Gephyrin 
@ Gephyrin 

w=== Direct interaction 


Extended Data Figure 8 | Shank3 interactome is connected with gephyrin- 
interacting actin-related proteins, Mena, profilin] and profilin2. Based ona 
literature search, we selected candidate actin-related proteins (Mena/VASP, 
profilin! and profilin2) that directly interact with inhibitory postsynaptic 
protein gephyrin. To understand potential interactions among these proteins 
and the Shank3 interactome, we generated interaction network using the 
information from our yeast two-hybrid screening and [RefIndex PPI database 
(http://irefindex.org). The interaction network shows that Mena/VASP, 
profilin1 and profilin2 are connected with Shank3 interactome mainly through 


actin-related proteins. Note that one of the gephyrin-interacting proteins, 
profilin2, was also identified by our Shank3 in vivo immunoprecipitation 


(Supplementary Table 4). 
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Extended Data Figure 9 | Mania-like behaviours of Shank3 transgenic mice 
are resistant to lithium treatment. a, Basal and amphetamine-induced 
locomotor activities of female transgenic mice were not affected by lithium 
treatment. b-e, Abnormal acoustic startle response (b, d) and PPI (c, e) of male 
and female transgenic mice were not rescued by lithium treatment. 


f, g, Immobile time of male (f) and female (g) transgenic mice in tail-suspension 
test was not rescued by lithium treatment. Wild-type female mice showed 
decreased immobile time upon lithium treatment, which is an expected 
response to high dose lithium treatment in wild-type mice. All data are 
presented as mean + s.e.m. *P < 0.05; ***P < 0.001. 
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Extended Data Figure 10 | Valproate treatment does not affect synaptic images of Shank3 transgenic cultured hippocampal pyramidal neurons (DIV 
protein and F-actin levels in neurons of Shank3 transgenic mice. a,Onehour 14) treated with different concentrations of valproate (0, 0.01, 0.1 and 1 mM) 
after final valproate injection (200 mg per kg), synaptosomal fraction was for 30 or 60 min. c, Quantification of (b). There was no significant change in 


prepared from brains, and indicated proteins were detected by western blotting. _ F-actin levels by valproate treatment (nm = 16 per condition). All data are 
Neither wild-type nor transgenic mice showed significant change in the levels presented as mean + s.e.m. 
of synaptic proteins by valproate treatment ( = 7). b, Representative confocal 
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Context-dependent computation by 
recurrent dynamics in prefrontal cortex 


Valerio Mante!+*, David Sussillo**, Krishna V. Shenoy”? & William T. Newsome! 


Prefrontal cortex is thought to have a fundamental role in flexible, context-dependent behaviour, but the exact nature of 
the computations underlying this role remains largely unknown. In particular, individual prefrontal neurons often 
generate remarkably complex responses that defy deep understanding of their contribution to behaviour. Here we 
study prefrontal cortex activity in macaque monkeys trained to flexibly select and integrate noisy sensory inputs 
towards a choice. We find that the observed complexity and functional roles of single neurons are readily understood 
in the framework of a dynamical process unfolding at the level of the population. The population dynamics can be 
reproduced by a trained recurrent neural network, which suggests a previously unknown mechanism for selection 
and integration of task-relevant inputs. This mechanism indicates that selection and integration are two aspects of a 
single dynamical process unfolding within the same prefrontal circuits, and potentially provides a novel, general 
framework for understanding context-dependent computations. 


Our interactions with the world are inherently flexible. Identical sensory 
stimuli, for example, can lead to very different behavioural responses 
depending on ‘context’, which includes goals, previous expectations 
about upcoming events, and relevant past experiences’*. Animals can 
switch rapidly between behavioural contexts, implying the existence of 
rapid modulation, or ‘gating’, mechanisms within the brain that select 
relevant sensory information for decision-making and action. A large 
attention literature suggests that relevant information is selected by 
top-down modulation of neural activity in early sensory areas**, which 
may take the form of modulation of firing rates**’, or modulation of 
response synchrony within or across areas***. The top-down signals 
underlying such ‘early’ modulations of sensory activity arise, in part, 
from prefrontal cortex (PFC)*”, which is known to contribute to repre- 
senting and maintaining contextual knowledge, ignoring irrelevant 
information, and suppressing inappropriate actions’*”°. These obser- 
vations have led to the hypothesis that early selection may account for 
the larger effect of relevant as compared to irrelevant sensory informa- 
tion on contextually sensitive behaviour. 

Here we test this hypothesis with a task requiring context-dependent 
selection and integration of visual stimuli. We trained two macaque 
monkeys (A and F) to perform two different perceptual discrimina- 
tions on the same set of visual stimuli (Fig. 1). The monkeys were 
instructed by a contextual cue to either discriminate the direction of 
motion or the colour of a random-dot display, and to report their 
choices with a saccade to one of two visual targets (Fig. 1a). While 
monkeys performed this task, we recorded extracellular responses from 
neurons in and around the frontal eye field (Extended Data Fig. 1a, f), 
an area of PFC involved in the selection and execution of saccadic eye 
movements’””’, the control of visuo-spatial attention’’, and the integ- 
ration of information towards visuomotor decisions'’*™*. 

We found no evidence that irrelevant sensory inputs are gated, or 
filtered out, before the integration stage in PFC, as would be expected 
from early selection mechanisms’ *. Instead, the relevant input seems 
to be selected late, by the same PFC circuitry that integrates sensory 
evidence towards a choice. Selection within PFC without previous 


gating is possible because the representations of the inputs, and of 
the upcoming choice, are separable at the population level, even though 
they are deeply entwined at the single neuron level. An appropriately 
trained recurrent neural network model reproduces key physiological 
observations and suggests a new mechanism of input selection and inte- 
gration. The mechanism reflects just two learned features of a dyna- 
mical system: an approximate line attractor and a ‘selection vector’, 
which are only defined at the level of the population. The model mech- 
anism is readily scalable to large numbers of inputs, indicating a gen- 
eral solution to the problem of context-dependent computation. 


Behaviour and single-unit responses 


The monkeys successfully discriminated the relevant sensory evidence 
in each context, while largely ignoring the irrelevant evidence (Fig. 1c-f, 
monkey A; Extended Data Fig. 2a-d, monkey F). To vary the difficulty 
of the discrimination, we changed the strength of the motion and colour 
signals randomly from trial to trial (Fig. 1b). In the motion context, the 
choices of the monkeys depended strongly on the direction of motion 
of the dots (Fig. 1c), whereas the choices depended only weakly on colour 
in the same trials (Fig. 1d). The opposite pattern was evident in the 
colour context: the now relevant colour evidence exerted a large effect 
on choices (Fig. 1f) whereas motion had only a weak effect (Fig. le). 
As is common in PFC'*!>', the recorded responses of single neurons 
appeared to represent several different task-related signals at once, includ- 
ing the monkey’s upcoming choice, the context, and the strength of 
motion and colour evidence (Extended Data Figs 1 and 3). Rather than 
attempting to understand the neural mechanism underlying selective 
integration by studying the responses of single PFC neurons, we focussed 
on analysing the responses of the population as a whole. To construct 
population responses, we pooled data from both single and multi-unit 
recordings, which yielded equivalent results. The great majority of 
units were not recorded simultaneously, but rather in separate sessions. 
Units at all recording locations seemed to contribute to the task-related 
signals analysed below (Extended Data Fig. 1) and were thus combined. 


1Howard Hughes Medical Institute and Department of Neurobiology, Stanford University, Stanford, California 94305, USA. *Department of Electrical Engineering and Neurosciences Program, Stanford 
University, Stanford, California 94305, USA. 3Departments of Neurobiology and Bioengineering, Stanford University, Stanford, California 94305, USA. +Present address: Institute of Neuroinformatics, 


University of Zurich/ETH Zurich, CH-8057 Zurich, Switzerland. 
*These authors contributed equally to this work. 
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Figure 1 | Behavioural task and psychophysical performance. a, Task 
structure. Monkeys were instructed by a contextual cue to either discriminate 
the motion or the colour of a random-dot stimulus, and indicate their choice 
with a saccade to one of two targets. Depending on context, monkeys were 
rewarded for choosing the target matching the prevalent direction of motion 
(motion context) or the prevalent colour (colour context) of the random dots. 
Context was indicated by the shape and colour of the fixation point; offset of 
the fixation point was the ‘go cue’, signalling the monkey to indicate its choice 
via the operant saccade. b, Stimulus set. The motion and colour coherence 

of the dots was chosen randomly on each trial. We slightly varied the coherence 
values on each day, to equate performance across contexts and sessions 
(numbers in parentheses: average coherences (%) across sessions for monkey 
A). e-f, Psychophysical performance for monkey A in the motion (top) and 
colour contexts (bottom), averaged over 80 recording sessions (163,187 trials). 
Performance is shown as a function of motion (left) or colour (right) coherence 
in each behavioural context. The curves are fits of a behavioural model. 


Overall, we analysed 388 single-unit and 1,014 multi-unit responses 
from the two monkeys. 


State space analysis 


To study how the PFC population as a whole dynamically encodes the 
task variables underlying the monkeys’ behaviour, we represent popu- 
lation responses as trajectories in neural state space’”’”*°. Each point 
in state space corresponds to a unique pattern of neural activations 
across the population. Because activations are dynamic, changing over 
time, the resulting population responses form trajectories in state space. 

We focussed our analyses on responses in a specific low-dimensional 
subspace that captures across-trial variance due to the choice of the 
monkey (choice 1 or 2), the strength and direction of the motion 
evidence, the strength and direction of the colour evidence, and context 
(motion or colour). We estimated this task-related subspace in two 
steps (Supplementary Information). First, we used principal component 
analysis (PCA) to obtain an unbiased estimate of the most prominent 
features (that is, patterns of activations) in the population response. To 
‘de-noise’ the population responses, we restricted subsequent analyses 
to the subspace spanned by the first 12 principal components. Second, 
we used linear regression to define the four orthogonal, task-related axes 
of choice, motion, colour and context. The projection of the population 
response onto these axes yields de-mixed estimates of the correspond- 
ing task variables, which are mixed both at the level of single neurons 
(Extended Data Fig. 3) and at the level of individual principal compo- 
nents (Extended Data Fig. 4c, g; see also ref. 26). 

This population analysis yields highly reliable average response 
trajectories (Fig. 2 and Extended Data Fig. 4q, r) that capture both 
the temporal dynamics and the relationships among the task variables 
represented in PFC. In particular, four properties of the population 
responses provide fundamental constraints on the mechanisms of 
selection and integration underlying behaviour in our task. 

First, integration of evidence during presentation of the random 
dots corresponds to a gradual movement of the population response 
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in state space along the axis of choice (Fig. 2a, f). In both contexts, the 
trajectories start from a point in state space close to the centre of the 
plots (‘dots on’, purple point), which corresponds to the pattern of 
population responses at baseline. During the dots presentation the 
responses then quickly move away from this baseline level, along the 
axis of choice (red line; Fig. 2a, f). Overall, the population response 
moves in opposite directions on trials corresponding to the two differ- 
ent saccade directions (Fig. 2, choice 1 versus choice 2). The projection 
of the population response onto the choice axis (Extended Data Fig. 5b, f) 
is largely analogous to the ‘choice-predictive’ signals that have been 
identified in past studies as approximate integration of evidence during 
direction discrimination tasks”’. 

Second, the sensory inputs into PFC produce patterns of popu- 
lation responses that are very different from those corresponding to 
either choice, meaning that these signals are separable at the level of 
the population. Indeed, the population response does not follow straight 
paths along the choice axis, but instead forms prominent arcs away 
from it (Fig. 2a, f). The magnitude of each arc along the axes of motion 
or colour reflects the strength of the corresponding sensory evidence 
(see scale), whereas its direction (up or down) reflects the sign of the 
evidence (towards choice 1 or 2, filled or empty symbols, respectively). 
Whereas the integrated evidence continues to be represented along the 
axis of choice even after the disappearance of the random dots (‘dots 
off), the signals along the axes of motion and colour are transient—the 
arcs return to points near the choice axis by the time of dots offset. 
These signals thus differ from integrated evidence both in terms of the 
corresponding patterns of activation and in their temporal profile. For 
these reasons, we interpret them as ‘momentary evidence’ from the 
motion and colour inputs in favour of the two choices. This interpreta- 
tion is also consistent with the observed population responses on error 
trials, for which the momentary evidence points towards the chosen 
target, but is weaker than on correct trials (Extended Data Fig. 5c, d; 
red curves). 

Third, context seems to have no substantial effect on the direction 
of the axes of choice, motion and colour, and only weak effects on the 
strength of the signals represented along these axes. When estimated 
separately during the motion and colour contexts, the two resulting 
sets of axes span largely overlapping subspaces (see Supplementary 
Table 1); thus, a single set of three axes (the red, black and blue axes in 
Fig. 2a—f, estimated by combining trials across contexts) is sufficient 
to capture the effects of choice, motion and colour on the population 
responses in either context. A comparison of the population responses 
across contexts (Fig. 2a-c versus d-f) reveals that a single, stable 
activity pattern is responsible for integrating the relevant evidence 
in both contexts (the choice axis), while similarly stable activity pat- 
terns represent the momentary motion and colour evidence in both 
contexts (motion and colour axes). Notably, motion and colour inputs 
result in comparable deflections along the motion and colour axes, 
respectively, whether they are relevant or not (compare Fig. 2a to d 
and f to c). 

Fourth, although the directions of the axes of choice, motion and 
colour are largely invariant with context, their location in state space is 
not. The responses during the motion and colour contexts occupy 
different parts of state space, and the corresponding trajectories are 
well separated along the axis of context (Extended Data Fig. 6a, b). 


Comparison to models of selection and integration 


These properties of the population responses, which are summarized 
schematically in Fig. 3a, can be compared to the predictions of current 
models of context-dependent selection and integration (Fig. 3b-d). 
We first focussed on three fundamentally different mechanisms of 
selection that could each explain why the motion input, for example, 
influences choices in the motion context (Fig. 3, top row) but not in 
the colour context (Fig. 3, bottom row). In the framework of our task 
the three models predict population responses that differ substantially 
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Figure 2 | Dynamics of population responses in PFC. The average 
population response for a given condition and time is represented as a point in 
state space. Responses from correct trials only are shown from 100 ms after 
dots onset (dots on, purple circle) to 100 ms after dots offset (dots off) in 50-ms 
steps, and are projected into the three-dimensional subspace capturing the 
variance due to the monkey’s choice (along the choice axis), and to the 
direction and strength of the motion (motion axis) and colour (colour axis) 
inputs. Units are arbitrary; components along the motion and colour axes are 
enhanced relative to the choice axis (see scale bars in a, f). Conditions (see 
colour bars) are defined based on context (motion context, top; colour context, 
bottom), on the location of the chosen target (choice 1 versus choice 2) and 
either on the direction and strength of the motion (grey colours) or the colour 
input (blue colours). Here, choice 1 corresponds to the target in the response 
field of the recorded neurons. The direction of the colour input does not refer to 


from each other (Fig. 3b-d), and can thus be validated or rejected by 
our PFC recordings (Fig. 3a). 

The first model (Fig. 3b) is based on two widely accepted hypo- 
theses about the mechanisms underlying selection and integration 
of evidence. First, it assumes that inputs are selected early’ *, such 
that a given input drives PFC responses when relevant (grey arrow in 
Fig. 3b, top), but is filtered out before reaching PFC when irrelevant 
(no grey arrow in Fig. 3b, bottom). Second, it assumes that the relevant 
input directly elicits a pattern of activation in PFC resembling the 
pattern corresponding to a choice (the grey arrow in Fig. 3b, top, 
points along the axis of choice), as would be expected by current 
models of integration®”. 

Both hypotheses are difficult to reconcile with the recorded PFC 
responses. Whereas the strength of each input is reduced when it is 
irrelevant compared to when it is relevant, the magnitude of the 
observed reduction seems too small to account for the behavioural 
effects. For instance, irrelevant motion of high coherence (Fig. 2d, 
black) elicits a larger deflection along the motion axis (relative to 
baseline, purple dot, Fig. 2d) than relevant motion of intermediate 
coherence (Fig. 2a, dark grey). Yet the former has almost no beha- 
vioural effect (Fig. le), whereas the latter has a large behavioural effect 
(Fig. 1c). The analogous observation holds for the colour input 
(Figs 2c, f and 1d, f), strongly suggesting that the magnitude of the 
momentary evidence alone does not determine whether the corres- 
ponding input is integrated. Furthermore, the actual momentary 
motion input is represented along a direction that has little overlap 
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the colour of the dots per se (red or green), but to whether the colour points 
towards choice 1 or choice 2 (see Supplementary Information, section 6.4, for a 
detailed description of the conditions). a, Effect of choice and the relevant 
motion input in the motion context, projected onto the axes of choice and 
motion. b, Same data as in a, but rotated by 90° around the axis of choice to 
reveal the projection onto the axis of colour. c, Same trials as in b, but re-sorted 
according to the direction and strength of the irrelevant colour input. 

d-f, Responses in the colour context, analogous to a—c. Responses are averaged 
to show the effects of the relevant colour (e, f) or the irrelevant motion input 
(d). For relevant inputs (a, b and e, f), correct choices occur only when the 
sensory stimulus points towards the chosen target (3 conditions per chosen 
target); for irrelevant inputs (c, d), however, the stimulus can point either 
towards or away from the chosen target on correct trials (6 conditions per 
chosen target). 


with the choice axis, resulting in curved trajectories (Fig. 3a) that 
differ markedly from the straight trajectories predicted by the early 
selection model (Fig. 3b). 

The observed PFC responses also rule out two additional models of 
selection presented in Fig. 3. In the absence of early selection, a motion 
input might be selected within PFC by modifying the angle between 
the choice and motion axes (that is, the similarity between patterns of 
neural activity representing choice and momentary motion evidence) 
across contexts. This angle could be modified either by changing the 
direction of the motion axis between contexts while keeping the choice 
axis fixed (Fig. 3c), or vice versa (Fig. 3d). In both cases, the motion 
input would elicit movement of the population along the axis of choice 
in the motion context (top row), but not in the colour context (bottom 
row), as the motion and choice axes have little or no overlap in the 
colour context. At the single neuron level, variable axes that change 
direction across contexts would be reflected as complex, nonlinear 
interactions between context and the other task variables, which have 
been proposed in some task-switching models****. However, our data 
(Figs 2 and 3a) lend little support for variable choice (Fig. 3d) or input 
(Fig. 3c) axes. More generally, the PFC data from monkey A rule out 
any model of integration for which the degree of overlap between the 
direction of the momentary evidence and the axis of choice determines 
how much the corresponding input affects behaviour. 

The representation of task variables in PFC of monkey F replicates 
all but one key feature observed in monkey A. Most importantly, 
population responses along the choice and motion axes (Extended 
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Figure 3 | Models of selective integration inconsistent with PFC responses. 
Schematic representation of population responses observed in PFC (a) and 
expected by several models of selective integration (b-d). The models differ 
from the PFC responses with respect to the relative directions and context 
dependence of the choice axis (red lines) and the inputs (thick grey arrows; only 
motion input is shown). The relevant input is integrated as movement along 
the choice axis towards one of two choices (red crosses). A motion input 
towards choice 1 ‘pushes’ the responses along the direction of the grey arrow 
(towards choice 2: opposite direction). Same conditions as in Fig. 2a (motion 
context, top) and Fig. 2d (colour context, bottom). As in Fig. 2a and d, a single 
two-dimensional subspace (which contains the choice axis and motion input) 
is used to represent responses from both contexts. a, Idealized schematic of the 
actual PFC trajectories shown in Fig. 2a, d. Both the choice axis and motion 
input are stable between contexts. The motion input pushes the population 
response away from the choice axis. b, Early selection model. When relevant 


Data Fig. 7a, d) closely match those observed in monkey A (Fig. 2a, d); 
thus, physiological data from both monkeys are consistent in rejecting 
current models of selection and integration of motion inputs (Fig. 3b-d). 
The colour signal in monkey F, however, is equivocal. On the one hand, 
the representation of the colour input closely resembles that of a choice 
(Extended Data Fig. 1g, i), as expected from the early selection model 
described above (Fig. 3b). On the other hand, the colour input is also 
weakly represented along the colour axis in both contexts (vertical 
displacement of trajectories, Extended Data Fig. 7c, f). For the colour 
input in monkey F, therefore, we cannot with confidence accept or 
reject the early selection model. Finally, as in monkey A, context is 
represented in monkey F along a separate axis of context (Extended 
Data Fig. 6c, d). 

In summary, the population responses in both monkeys are dif- 
ficult to reconcile with current models of selection and integration 
(see also Extended Data Fig. 8). Rather, the selective integration of the 
motion input in monkeys A and F, and of the colour input in monkey 
A, must rely on a mechanism for which the very same input into PFC 
leads to movement along a fixed axis of choice in one context but not 
another. 


Recurrent network model of selection and integration 


To identify such a mechanism, we trained a network of recurrently 
connected, nonlinear neurons” to solve a task analogous to the one 
solved by the monkeys (Fig. 4). Notably, we only defined ‘what’ the 
network should do, with minimal constraints on ‘how it should do 
it” **. Thus, the solution achieved by the network is not hand-built 
into the network architecture. On each trial, neurons in the network 
receive two independent sensory inputs that mimic the momentary 
evidence for motion and colour in a single random dot stimulus. The 
network also receives a contextual input that mimics the contextual 
signal provided to the monkeys, instructing the network to discrim- 
inate either the motion or the colour input. The network activity is 
read out by a single linear read-out, corresponding to a weighted sum 


(top), the motion input pushes the population response along the choice axis. 
When irrelevant (bottom), the motion input is filtered out before reaching PFC 
(no thick grey arrow) and thus exerts no effect on choice. All trajectories fall on 
top of each other in both contexts, but the rate of movement along the choice 
axis increases with motion strength only in the motion context (insets show 
enlarged trajectories distributed vertically for clarity). c, Context-dependent 
input direction. Motion input direction varies between contexts, whereas the 
choice axis is stable. Inputs are not filtered out before PFC; rather, they are 
selected on the basis of their projection onto the choice axis. d, Context- 
dependent output direction. Similar selection mechanism to c, except that the 
choice axis varies between contexts, whereas the motion input is stable. The 
effects of the motion input on PFC responses in both monkeys (schematized in 
a) and the effects of the colour input in monkey A are inconsistent with 
predictions of the three models in b-d (respectively, Fig. 2a, d; Extended Data 
Fig. 7a,d; Fig. 2f, c). 


over the responses of all neurons in the network (see Supplementary 
Information). As in PFC, the contextual input does not affect the 
strength of the sensory inputs—selection occurs within the same net- 
work that integrates evidence towards a decision. 

We trained the network** to make a binary choice on each trial—an 
output of +1 at the end of the stimulus presentation if the relevant 
evidence pointed leftward, or a —1 if it pointed rightward. After 
training, the model qualitatively reproduces the monkeys’ behaviour, 
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Motion | 
context 
Motion Choice 1 
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Figure 4| A neural network model of input selection and integration. PFC 
is modelled as a network of recurrently connected, nonlinear, rate neurons 
that receive independent motion, colour and contextual inputs. The network is 
fully recurrently connected, and each unit receives both motion and colour 
inputs as well as two inputs that indicate context. At each time step, the sensory 
inputs are drawn from two normal distributions, the means of which 
correspond to the average strengths of the motion and colour evidence on a 
given trial. The contextual inputs take one of two values (0 or 1), which instruct 
the network to discriminate either the motion or the colour input. The network 
is read out by a single linear read-out, corresponding to a weighted sum 

over the responses of all neurons (red arrows). We trained the network (with 
back-propagation**) to make a binary choice, that is, to generate an output of 
+1 at the end of the stimulus presentation if the relevant evidence pointed 
towards choice 1, or a —1 if it pointed towards choice 2. Before training, all 
synaptic strengths were randomly initialized. 
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confirming that the model solves the selection problem at the “beha- 
vioural’ level (Extended Data Fig. 2e-h). 

We first analysed model population trajectories in the subspace 
spanned by the axes of choice, motion and colour, and found that 
they reproduce the four main features of the PFC population responses 
discussed above (Fig. 5 and Extended Data Fig. 9a—g). First, integration 
of evidence corresponds to gradual movement of the population res- 
ponse along the choice axis. Second, momentary motion and colour 
evidence ‘push’ the population away from the choice axis, resulting in 
trajectories that are parametrically ordered along the motion and col- 
our axes. Third, the direction of the axes of choice, motion and colour 
are largely invariant with context, as are the strength of the motion and 
colour inputs, as these are not gated before entering the network. 
Fourth, the trajectories during motion and colour contexts are sepa- 
rated along the axis of context (Extended Data Fig. 9f, g). Model and 
physiological dynamics differ markedly in one respect—signals along 
the input axes are transient in the physiology, but not in the model, 
yielding PFC trajectories that curve back to the choice axis before the 
end of the viewing interval (compare Figs 5a, f to 2a, f). This difference 
suggests that the sensory inputs to PFC are attenuated after a decision is 
reached. Additional differences between the model and the physio- 
logical dynamics can be readily explained by previously proposed 
imperfections in the evidence integration process, such as ‘urgency’ 
signals**”’ or instability in the integrator’* (Extended Data Fig. 10). 


A novel mechanism of selective integration 

We then ‘reverse engineered’ the model® to discover its mechanism of 
selective integration. The global features of the model activity are 
easily explained by the overall arrangement of fixed points of the 
dynamics*’ (Fig. 5), which result from the synaptic connectivity 
learned during training. Fixed points (small red crosses) correspond 
to patterns of neuronal activations (that is, locations in state space) 
that are stable when the sensory inputs are turned off. First, we found 
that the model generates a multitude of fixed points, which are 
approximately arranged to form two lines along the choice axis. 
The two sets of fixed points are separated along the axis of context 
(Extended Data Fig. 9f, g) and never exist together—one exists in the 
motion context (Fig. 5a—c), the other in the colour context (Fig. 5d-f). 
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Second, the responses around each fixed point were approximately 
stable only along a single dimension pointing towards the neighbour- 
ing fixed points (red lines), whereas responses along any other dimen- 
sion rapidly collapsed back to the fixed points. Therefore, each set of 
fixed points approximates a line attractor”. Finally, two stable attrac- 
tors (large red crosses), corresponding to the two possible choices, 
delimit each line attractor. 

The integration of the relevant evidence is thus implemented in the 
model as movement along an approximate line attractor’. The model 
population response, however, does not move strictly along the line 
attractor. Like the physiological data, model trajectories move parallel 
to the line attractors (the choice axis) at a distance proportional to the 
average strength of the sensory inputs, reflecting the momentary 
sensory evidence (Fig. 5a, c, d, f). After the inputs are turned off 
(Fig. 5, purple data points), the responses rapidly relax back to the 
line attractor. 

To understand how the relevant input is selected for integration 
along a line attractor, we analysed the local dynamics of model res- 
ponses around the identified fixed points** (Fig. 6). To simplify the 
analysis, we studied how the model responds to brief pulses of motion 
or colour inputs (Fig. 6a), rather than the noisy, temporally extended 
inputs used above. Before a pulse, we initialized the state of the network 
to one of the identified fixed points (Fig. 6a, red crosses). Locally around 
a fixed point, the responses of the full, nonlinear model can then be 
approximated by a linear dynamical system (see Supplementary Infor- 
mation), the dynamics of which can be more easily understood”. 

Both the motion and colour inputs (that is, the corresponding pulses) 
have substantial projections onto the line attractor (Fig. 6a) but, cru- 
cially, the size of these projections does not predict the extent to which 
each input will be integrated. For instance, in both contexts the motion 
pulses have similar projections onto the line attractor (Fig. 6a, left 
panels), and yet they result in large movement along the attractor in 
the motion context (top) but not in the colour context (bottom). 

The selection of the inputs instead relies on context-dependent 
relaxation of the network dynamics after the end of the pulse, which 
reverses movement along the line attractor caused by the irrelevant 
pulse (Fig. 6a, top right and bottom left) and enhances the effects of 
the relevant pulse (Fig. 6a, top left and bottom right). These relaxation 


Figure 5 | Model dynamics and fixed points 
analysis. a-f, Dynamics of model population 
responses, same conventions as in Fig. 2. Responses 
are projected into the three-dimensional subspace 
spanned by the axes of choice, motion and colour 
(defined here based on the model synaptic weights, 
see Supplementary Information, section 7.6). 
Movement along the choice axis corresponds to 
integration of evidence, and the motion and colour 
inputs deflect the trajectories along the 
corresponding input axes. Fixed points of the 
dynamics (red crosses) were computed separately 
for motion (a-c) and colour contexts (d-f) in the 
absence of sensory inputs (see Supplementary 
Information, section 7.5). The fixed points are 
‘marginally stable’ (that is, one eigenvalue of the 
linearized dynamics is close to zero, whereas all 
others have strongly negative real parts; see 
Supplementary Information). The locally 
computed right zero-eigenvectors (red lines) point 
to the neighbouring fixed points, which thus 
approximate a line attractor in each context. After 
the inputs are turned off (dots off, purple data 
points and lines) the responses relax back towards 
the line attractor. Each line attractor ends in two 
‘stable’ attractors (that is, all eigenvalues have 
strongly negative real parts, large crosses) 
corresponding to model outputs of +1 and —1 
(that is, choice 1 or 2). 
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Figure 6 | Selection and integration by recurrent dynamics. a, Average 
model population response to short (1-ms) pulses of motion (left) and colour 
inputs (right) during motion (top) and colour contexts (bottom). Motion or 
colour inputs (solid lines) are initiated when the system is steady at one of the 
identified fixed points (red crosses), and subsequent relaxation back to the line 
attractor is simulated (dots: 3-ms intervals) and averaged across fixed points. 
The size of the pulses approximately corresponds to the length of the scale bars 
in Fig. 5. Selection of the relevant input results from the context-dependent 
relaxation of the recurrent dynamics after the pulse, and is well approximated 
by the linearized dynamics around the fixed points (magenta lines). Responses 


dynamics, although counterintuitive, nevertheless follow a very simple 
tule. For a given context, the relaxation always occurs on a path that is 
orthogonal to a specific direction in state space, which we call the 
‘selection vector’ (Fig. 6b). The direction of the selection vector, like 
the direction of the line attractor, is a property of the recurrent synaptic 
weights learned by the model during training (see Supplementary 
Information). Unlike the line attractor, however, the orientation of 
the selection vector changes with context—it projects strongly onto 
the relevant input, but is orthogonal to the irrelevant one (Fig. 6b). Asa 
consequence, the relaxation dynamics around the line attractor are 
context dependent. This mechanism explains how the same sensory 
input can result in movement along the line attractor in one context but 
not the other (Fig. 6b). 

The line attractor and the selection vector are sufficient to explain 
the linearized dynamics around each fixed point (see Supplementary 
Information), and approximate well the responses of the full model 
(magenta curves, Fig. 6a). Mathematically, the line attractor and the 
selection vector correspond to the right and left zero-eigenvector of 
the underlying linear system. Within a context, these locally defined 
eigenvectors point in a remarkably consistent direction across differ- 
ent fixed points—the selection vector, in particular, is always aligned 
with the relevant input and orthogonal to the irrelevant input (Fig. 6c 
and Extended Data Fig. 10q-s). As a result, the two line attractors 
(Fig. 6c) show relaxation dynamics appropriate for selecting the rel- 
evant input along their entire length. 


Discussion 


We describe a novel mechanism underlying flexible, context-dependent 
selection of sensory inputs and their integration towards a choice (see 
refs 39-41 for related concepts). This mechanism is sufficient to explain 
the selection and integration of motion inputs in both monkeys, and of 
colour inputs in monkey A, which are not filtered out by context before 
they reach PFC. 

A randomly initialized, recurrent neural network trained to solve a 
task analogous to the monkeys’ task reproduces the main features of 
the data, and analysis of the trained network elucidates the novel 
selection mechanism. Integration along line attractors, and its relation 
to the selection vector, has been described before”’. However, our model 
demonstrates how a single nonlinear model can implement flexible 
computations by reconfiguring the selection vector and the correspond- 
ing recurrent dynamics based ona contextual input. Counterintuitively, 
in the model the projection of an input onto the line attractor does 
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are projected into the two-dimensional subspace spanned by the direction 
of the pulse and the locally computed line attractor (the right zero-eigenvector 
of the linearized dynamics). b, Explanation of how the same input pulse (left) 
leads to evidence integration in one context, but is ignored in the other (right). 
Relaxation towards the line attractor (small arrows) is always orthogonal to 
the context-dependent selection vector, and reverses the effects of the 
irrelevant pulse. c, Global arrangement of the line attractor (red) and selection 
vector (green) at each fixed point. Inputs are selected by the selection vector, 
which is orthogonal to the contextually irrelevant input (note input axes, right), 
and integrated along the line attractor. 


not determine the extent to which it is integrated, a manifestation of 
‘non-normal’ dynamics*”**** (see Supplementary Information). 

Our results show that the modulation of sensory responses is not 
necessary to select among sensory inputs (see also refs 44—46). Consistent 
with this conclusion, two studies using tasks similar to ours’””, as well 
as our own recordings in the middle temporal visual area (MT) of 
monkey A (data not shown), have found no evidence for consistent 
firing rate modulations in the relevant sensory areas. The dynamical 
process outlined in this paper is fully sufficient for context-dependent 
selection in a variety of behavioural models**, but it need not be 
exclusive. Multiple selection mechanisms may exist within the brain. 

Our results indicate that computations in prefrontal cortex emerge 
from the concerted dynamics of large populations of neurons, and are 
well studied in the framework of dynamical systems!”!?****??, 
Notably, the rich dynamics of PFC responses during selection and 
integration of inputs can be characterized and understood with just 
two features ofa dynamical system—the line attractor and the selection 
vector, which are defined only at the level of the neural population. 
This parsimonious account of cortical dynamics contrasts markedly 
with the complexity of single neuron responses typically observed in 
PFC and other integrative structures, which reveal multiplexed repres- 
entation of many task-relevant and choice-related signals'*'*'°7°7°°°, 
In light of our results, these mixtures of signals can be interpreted as 
separable representations at the level of the neural population’*!””°”*. 
A fundamental function of PFC may be to generate such separable 
representations, and to flexibly link them through appropriate recur- 
rent dynamics to generate the desired behavioural outputs. 


METHODS SUMMARY 


Two adult male rhesus monkeys (14 and 12 kg) were trained on a two-alternative, 
forced-choice, visual discrimination task. While the monkeys were engaged in the 
behavioural task, we recorded single- and multiunit responses in the arcuate 
sulcus and the prearcuate gyrus, and in cortex near and lateral to the principal 
sulcus. The great majority of neurons were not recorded simultaneously, but 
rather in separate behavioural sessions. All surgical and behavioural procedures 
conformed to the guidelines established by the National Institutes of Health and 
were approved by the Institutional Animal Care and Use Committee of Stanford 
University. We pooled data from single- and multiunit recordings to construct 
population responses, and used state space analysis to study the effect of task 
conditions and time on the population responses. We developed a dimensionality 
reduction technique (‘targeted dimensionality reduction’) to identify a low- 
dimensional subspace capturing variance due to the task variables of interest. 
We compared the recorded responses to the responses of units in a nonlinear, 
recurrent neural network model. We trained the model (that is, optimized its 
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synaptic weights with a ‘back-propagation’ algorithm) to perform a task analog- 
ous to the one performed by the monkeys. We then reverse-engineered the model 
to discover its mechanism of selective integration. We identified fixed points of 
the model dynamics, linearized the dynamics around the fixed points, and used 
linear systems analysis to understand the linearized dynamics. Full methods are 
provided in the Supplementary Information. 


Online Content Any additional Methods, Extended Data display items and 
Source Data are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Extended Data Figure 1 | Recording locations and task-related patterns of 
population activity in PFC. a, Recording locations (red dots) in monkey A are 
shown on anatomical magnetic resonance images in imaging planes that were 
oriented perpendicularly to the direction of electrode penetrations. Electrodes 
were lowered through a grid (1-mm spacing) positioned over the arcuate sulcus 
(AS). Recordings covered the entire depth of the AS and extended rostrally onto 
the prearcuate gyrus and cortex near and lateral to the principal sulcus (PS). 
b-e, Representation of four task variables in the population response. Each 
multi-coloured square corresponds to a recording location (red dots) in 

a. Within each square, each pixel corresponds to a unit recorded from that grid 
position, such that each square represents all the units recorded at the 
corresponding location. The colour of a pixel indicates the de-noised regression 
coefficient of choice (b), motion coherence (c), colour coherence (d) and 
context (e) for a given unit (colour bars; grey: no units). These coefficients 
describe how much the trial-by-trial firing rate of a given unit depends on the 
task variables in b-e. The position of each unit within a square is arbitrary; we 
therefore sorted them according to the amplitude of the coefficient of choice, 
which accounts for the diagonal bands of colour in b (top-left to bottom-right, 
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high to low choice coefficient). The positions of the pixels established in b are 
maintained in c-e, so that one can compare the amplitude of the coefficient for 
each task variable for every unit recorded from monkey A. Each of the four 
panels can be interpreted as the pattern of population activity elicited by the 
corresponding task variable. The four task variables elicit very distinct patterns 
of activity and are separable at the level of the population. Importantly, the 
coefficients were de-noised with principal component analysis (see 
Supplementary Information, section 6.7) and can be estimated reliably from 
noisy neural responses (Extended Data Fig. 4i-l). Differences between 
activation patterns therefore reflect differences in the properties of the 
underlying units, not noise. f-j, Recording locations and task-related patterns 
of population activity for monkey F. Same conventions as in a—e. Recordings 
(f) covered the entire depth of the AS. The patterns of population activity 
elicited by a choice (g), by the motion evidence (h) and by context (j) are 
distinct, meaning that the representations of these task variables are separable at 
the level of the population. The representations of choice (g) and colour 

(i), however, are not separable in monkey F, indicating that colour inputs are 
processed differently in the two monkeys (see main text). 
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Extended Data Figure 2 | Psychophysical performance for monkey F and 
for the model. a-d, Psychophysical performance for monkey F, for motion 
(top) and colour contexts (bottom), averaged over 60 recording sessions 
(123,550 trials). Performance is shown as a function of motion (left) or colour 
(right) coherence in each behavioural context. As in Fig. 1c-f, coherence values 
along the horizontal axis correspond to the average low, intermediate and high 
motion coherence (a, c) and colour coherence (b, d) computed over all 
behavioural trials. The curves are fits of a behavioural model (see 
Supplementary Information, section 4). e-h, ‘Psychophysical’ performance for 
the trained neural-network model (Figs 4-6) averaged over a total of 14,400 
trials (200 repetitions per condition). Choices were generated based on the 
output of the model at the end of the stimulus presentation—an output larger 
than zero corresponds to a choice to the left target (choice 1), and an output 


model (fig. 5) 
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smaller than zero corresponds to a choice to the left target (choice 2). We 
simulated model responses to inputs with motion and colour coherences of 
0.03, 0.12 and 0.50. The variability in the input (that is, the variance of the 
underlying Gaussian distribution) was chosen such that the performance of the 
model for the relevant sensory signal qualitatively matches the performance of 
the monkeys. As in Fig. 1c-f, performance is shown as a function of motion 
(left) or colour (right) coherence in the motion (top) and colour contexts 
(bottom). Curves are fits of a behavioural model (as in a-d and in Fig. 1c-f). In 
each behavioural context, the relevant sensory input affects the model’s choices 
(e, h), but the irrelevant input does not (f, g), reflecting successful context- 
dependent integration. The model output essentially corresponds to the 
bounded temporal integral of the relevant input (not shown) and is completely 
unaffected by the irrelevant input. 
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Extended Data Figure 3 | Mixed representation of task variables in PFC. 
a-d, Example responses from six well-isolated single units in monkey A. Each 
column shows average normalized responses on correct trials for one of the 
single units. Responses are aligned to the onset of the random-dot stimulus, 
averaged with a 50-ms sliding window, and sorted by one or more task-related 
variables (choice, motion coherence, colour coherence, context). The green 
lines mark time intervals with significant effects of choice (a), motion 
coherence (b), colour coherence (c), or context (d) as assessed by multi- 
variable, linear regression (regression coefficient different from zero, P < 0.05). 
Linear regression and coefficient significance are computed over all trials 
(correct and incorrect, motion and colour context; Supplementary 
Information, section 6.3). The horizontal grey line corresponds to a normalized 
response equal to zero. a, Responses sorted by choice (solid, choice 1; dashed, 
choice 2) averaged over both contexts. b, Responses during motion context, 
sorted by choice and motion coherence (black to light-grey, high to low motion 
coherence). c, Responses during colour context, sorted by choice and colour 
coherence (blue to cyan, high to low colour coherence). d, Responses sorted by 
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choice and context (black, motion context; blue, colour context). As is typical 
for PFC, the activity of the example units depends on many task variables, 
indicating that they represent mixtures of the underlying task variables. e, f, De- 
noised regression coefficients for all units in monkey A (e) and monkey F (f). 
The data in Extended Data Fig. 1 are re-plotted here to directly compare the 
effects of different task variables (choice, motion, colour, context) to each other. 
Each data point corresponds to a unit, and the position along the horizontal and 
vertical axes is the de-noised regression coefficient for the corresponding 
task variable. The horizontal and vertical lines in each panel intersect at the 
origin (0,0). Scale bars span the same range (0.1) in each panel. The different 
task variables are mixed at the level of individual units. Although units 
modulated by only one of the task variables do occur in the population, they do 
not form distinct clusters but rather are part of a continuum that typically 
includes all possible combinations of selectivities. Significant correlations 
between coefficients are shown in red (P < 0.05, Pearson’s correlation 
coefficient r). 
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Extended Data Figure 4 | Targeted dimensionality reduction of population 
responses, and reliability of task-related axes and population trajectories. 
a, Fraction of variance explained by the first 20 principal components of the 
responses in monkey A. Principal components are computed on correct trials 
only, on condition-averaged responses. Conditions are defined on the basis 
of choice, motion coherence, colour coherence and context. Each time point of 
the average response for a given condition contributes an ‘independent’ sample 
for the principal components analysis, and variance is computed over 
conditions and times. b, Fraction of variance explained by the first 12 principal 
components. The total explainable variance (100%) is computed separately at 
each time, and reflects response differences across conditions. c, The four ‘task- 
related axes’ of choice, motion, colour and context expressed as linear 
combinations of the first 12 principal components. The four axes span a 
subspace containing the task-related variance in the population response (for 
example, Fig. 2 and Extended Data Fig. 6) and are obtained by orthogonalizing 
the de-noised regression vectors for the corresponding task variables (see 
Supplementary Information, section 6.7; de-noised regression coefficients are 
shown in Extended Data Figs 1 and 3e, f). The vertical axis in ¢ corresponds to 
the projection of each axis onto a given principal component (that is, the 
contribution of that principal component to each axis). All four axes project 
onto multiple principal components and thus the corresponding task variables 
are mixed at the level of single principal components. d, Fraction of variance 
explained by the task-related axes of choice, motion, colour and context (solid 
lines), as in b. The four axes explain a larger fraction of the variance than the 
principal components at many times but, unlike the principal components, 
they do not explain the variance common to all conditions that is due to the 
passage of time (not shown). A possible concern with our analysis is that the 
time courses of variance explained in d could be misleading if the task-related 
axes, which we estimated only at a single time for each variable, are changing 
over time during the presentation of the random dots. Under this scenario, for 
example, the ‘humped’ shape of the motion input (solid black trace) might 
reflect a changing ensemble code for motion rather than actual changes in the 
strength of the motion signal in the neural population. To control for this 
possibility, we also computed time-varying ‘task-related axes’ by estimating 
the axes of motion, colour and context separately at each time throughout the 
750-ms dots presentation. The fractions of variance explained by the time- 
varying axes (dashed lines) and by the fixed axes (solid lines) have similar 
amplitudes and time courses. Thus, the effects of the corresponding task 
variables (during the presentation of the random dots) are adequately captured 
by the subspace spanned by the fixed axes (see Supplementary Information, 
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section 6.8). e-h, Same as a—d, for monkey F. As shown in Extended Data 
Figs 1g, i and 3f (top-right panel) the de-noised regression coefficients of colour 
and choice are strongly correlated. As a consequence, the axis of colour explains 
only a small fraction of the variance in the population responses (h, blue; see 
main text). i-l, Reliability of task-related axes in monkey A. To determine to 
what extent variability (that is, noise) in single unit responses affects the task- 
related axes of choice, motion, colour and context (for example, Fig. 2 and 
Extended Data Fig. 6), we estimated each axis twice from two separate sets of 
trials (trial sets 1 and 2 in i-I). For each unit, we first assigned each trial to one of 
two subsets, and estimated de-noised regression coefficients for the task 
variables separately for the two subsets. We then obtained task-related axes by 
orthogonalizing the corresponding de-noised coefficients (see Supplementary 
Information, section 6.9). Here, the orthogonalized coefficients are computed 
both with (black) and without (grey) PCA-based de-noising. The horizontal 
and vertical lines in each panel intersect at the origin (0,0). Scale bars span the 
same range (0.1) in each panel. Data points lying outside the specified 
horizontal or vertical plotting ranges are shown on the corresponding edges in 
each panel. i, Coefficients of choice. Each data point corresponds to the 
orthogonalized coefficient of choice for a given unit, computed from trials in set 
1 (horizontal axis) or in set 2 (vertical axis). j-I, Same as i for the orthogonalized 
coefficients of motion (j), colour (k) and context (I). m-p, Orthogonalized 
regression coefficients for monkey F, as in i-I. Overall, after de-noising the 
orthogonalized coefficients are highly consistent across the two sets of trials. 
Therefore, the observed differences in the activation pattern elicited by different 
task variables (Extended Data Fig. 1) are not due to the noisiness of neural 
responses, but rather reflect differences in the properties of the underlying 
units. q, r, Reliability of population trajectories. To assess the reliability of the 
trajectories in Fig. 2, we estimated the task-related axes and the resulting 
population trajectories (same conventions as Fig. 2) twice from two separate 
sets of trials (as i-], see Supplementary Information, section 6.9). As in the 
example trajectories shown in q (trial set 1) and r (trial set 2), we consistently 
obtained very similar trajectories across the two sets of trials. To quantify the 
similarity between the trajectories from the two sets, we used trajectories 
obtained from one set to predict the trajectories obtained from the other set (see 
Supplementary Information, section 6.9). On average across 20 randomly 
defined pairs of trial sets, in both monkeys the population responses from one 
set explain 94% of the total variance in the responses of the other set (95% for 
the example in q and r). These numbers provide a lower bound on the true 
reliability of trajectories in Fig. 2, which are based on twice as many trials as 
those in q and r. 
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Extended Data Figure 5 | Population responses along individual task- 
related axes. a—e, Responses for monkey A. The average population responses 
on correct trials are re-plotted from Fig. 2, together with responses on a subset 
of incorrect trials (red curves). Here the responses are represented explicitly 
as a function of time (horizontal axis) and projected separately (vertical axes) 
onto the axes of choice (b), motion (c), colour (d) and context (e). As in Fig. 2, 
correct trials are sorted on the basis of context (motion: top sub-panels; colour: 
bottom sub-panels; see key in a), on the direction of the sensory evidence 
(filled, towards choice 1; dashed, towards choice 2) and strength of the sensory 
evidence (black to light-grey, strongest to weakest motion; blue to cyan, 
strongest to weakest colour), and based on choice (thick, choice 1; thin, 
choice 2). Incorrect trials (red curves) are shown for the lowest motion 
coherence (during motion context, top left in b-e) and the lowest colour 
coherence (during colour context, bottom right in b-e). Vertical scale bars 
correspond to 1 unit of normalized response, and the horizontal lines are drawn 
at the same level in all four sub-panels within b-e. a, Key to the condition 
averages shown in each panel of b-e, as well as to the corresponding state-space 
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panels in Fig. 2. b, Projections of the population response onto the choice axis. 
Responses along the choice axis represent integration of evidence in both 
contexts. c, Projection onto the motion axis. Responses along the motion axis 
represent the momentary motion evidence during both motion (top left) and 
colour contexts (bottom left) (curves are parametrically ordered based on 
motion strength in both contexts), but not the colour evidence (right, curves are 
not ordered based on colour strength). d, Projection onto the colour axis. 
Responses along the colour axis represent the momentary colour evidence in 
the motion (top right) and colour contexts (bottom right) (ordered), but not the 
motion evidence (left, not ordered). e, Projection onto the context axis. 
Responses in the motion context (top, all curves above the horizontal line) and 
colour context (bottom, all curves below the horizontal line) are separated along 
the context axis, which maintains a representation of context. f-i, Responses 
for monkey F, same conventions as in b-e. The responses in f-i are also shown 
as trajectories in Extended Data Fig. 7g-l. The drift along the choice axis in 
Extended Data Fig. 7g-] is reflected in the overall positive slopes in f. 
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Extended Data Figure 6 | Effect of context on PFC dynamics. a, b, Responses 
from monkey A. Same conditions and conventions as in Fig. 2, but for activity 
projected into the two-dimensional subspace capturing the variance due to 
choice (along the choice axis) and context (context axis). Components along 
the choice axis are enhanced relative to the context axis (see scale bars). The 
population response contains a representation of context, which is reflected in 
the separation between trajectories in the motion and colour contexts along the 
axis of context. The contextual signal is strongest early during the dots 
presentation. a, Effects of context (motion context versus colour context), 
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Extended Data Figure 7 | Dynamics of population responses in monkey F. 
a-f, Response trajectories in the subspace spanned by the task-related axes of 
choice, motion and colour. Same conventions as in Fig. 2. Unlike in Fig. 2, here 
we subtracted the across-condition average trajectory from each individual, raw 
trajectory (see Supplementary Information, section 6.10). The raw trajectories 
are shown in g-l and the corresponding projections onto individual axes in 
Extended Data Fig. 5f-i. Three key features of the population responses are 
shared in monkey A (Fig. 2) and monkey F. First, movement along a single 
choice axis (a and f, red arrows) corresponds to integration of the relevant 
evidence in both contexts. Second, in both contexts the momentary motion 
evidence elicits responses along the axis of motion, which is substantially 
different from the axis of choice (a and d). Third, the motion evidence is 
strongly represented whether it is relevant (a) or irrelevant (d). Thus, the 
processing of motion inputs in both monkeys is inconsistent with current 
models of selection and integration (Fig. 3b-d). Unlike in monkey A, responses 
along the colour axis in monkey F (f and c) reflect the momentary colour 
evidence only weakly. The effects of colour on the trajectories in monkey F 


choice (choice 1 versus choice 2), and motion input (direction and coherence, 
grey colours). b, Same trials as in a, but averaged to show the effect of the colour 
input (blue colours). c, d, Responses from monkey F, same conventions as in 
a, b. As in Extended Data Fig. 7a-f, we subtracted the across-condition 
average trajectory from each individual, raw trajectory (see Supplementary 
Information, section 6.10). The underlying raw population responses are 
shown in Extended Data Fig. 5f-i, and confirm that the representation of 
context is stable throughout the dots presentation time (Extended Data Fig. 5i). 
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resemble the responses expected by the early selection model (Fig. 3b). g-l, Raw 
population responses. Population trajectories were computed and are 
represented as in Fig. 2. The trajectories in a-f were obtained by subtracting the 
across-condition average from each individual trajectory shown above. Overall, 
the responses have a tendency to move towards the left along the choice axis. An 
analogous, although weaker, overall drift can also be observed in monkey A, 
and contributes to the asymmetry between trajectories on choice 1 and choice 2 
trials (Fig. 2). Because choice 1 corresponds to the target in the response field of 
the recorded neurons (see Supplementary Information, section 6.2), the drift 
reflects a tendency of individual firing rates to increase throughout the stimulus 
presentation time. By the definition of choice 1 and choice 2, a similar but 
opposite drift has to occur in neurons whose response field overlaps with choice 
2 (the responses of which we did not record). In the framework of diffusion-to- 
bound models, such a drift can be interpreted as an urgency signal, which 
guarantees that the decision boundary is reached before the offset of the dots 
(refs 36, 37). 
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Extended Data Figure 8 | Simulations of models of selective integration 
inconsistent with PFC responses. We simulated population responses 
mimicking the observed PFC responses (a-c) and alternative responses 
expected based on the three models of context-dependent selection described in 
Fig. 3b-d (d-I) (see Supplementary Information, section 8). These simulations 
are based on a diffusion-to-bound model, unlike the simulations of the 
recurrent neural network models in Figs 5 and 6 and in Extended Data Figs 9 
and 10e-s. Here, single neurons represent mixtures of three time-dependent 
task variables of a diffusion-to-bound model, namely the momentary motion 
and colour evidence and the integrated relevant evidence. At the level of the 
population, these three task variables are represented along specific directions 
in state space (arrows in a, d, g, j; red, integrated evidence; black, momentary 
motion evidence; blue, momentary colour evidence). The four simulations 
differ only with respect to the direction and context dependence of the three 
task variables. We computed state space trajectories from the population 
responses using the targeted dimensionality reduction techniques discussed in 
the main text and in Supplementary Information. The resulting simulated 
population responses reproduce the schematic population responses in Fig. 3. 
a-c, Simulated population responses mimicking the observed PFC responses 
(Fig. 2). a, Response trajectories in the two-dimensional subspace capturing the 
effects of choice and motion (left) or choice and colour (right) in the motion 
(top) and colour (bottom) contexts. Same conditions and conventions as in 
Fig. 2a, c and Fig. 2d, f. The three task variables are represented along three 
orthogonal directions in state space (arrows). b, Regression coefficients of 
choice, motion and colour for all simulated units in the population. For each 


ARTICLE 


unit, coefficients were computed with linear regression on all simulated trials 
(top) or separately on trials from the motion or colour context (bottom, context 
in parentheses). Scale bars represent arbitrary units. Numbers in the inset along 
each axis represent averages of the absolute value of the corresponding 
coefficients (+s.e.m., in parentheses). Significant correlations between 
coefficients are shown in red (P < 0.05, Pearson’s correlation coefficient r. 

c, Estimated strengths of the motion (top) and colour (bottom) inputs during 
motion (black) and colour (blue) contexts. Input strength is defined as the 
average of the absolute value of the corresponding regression coefficients. 
d-f, same as a-c, for simulated population responses expected from context- 
dependent early selection (Fig. 3b). When relevant, momentary motion (top) 
and colour (bottom) evidence are represented along the same direction as 
integrated evidence (arrows in d). g-i, same as a-c, for simulated population 
responses expected from context-dependent input directions (Fig. 3c). 
Integrated evidence is represented along the same direction in both contexts 
(red arrows in g). The relevant momentary evidence (motion in the motion 
context, top; colour in the colour context, bottom) is aligned with the direction 
of integration, whereas the irrelevant momentary evidence is orthogonal to it 
(black and blue arrows in g). j-I, same as a-c, for simulated population 
responses expected from context-dependent output directions (Fig. 3d). The 
momentary motion and colour evidence are represented along the same 
directions in both contexts (black and blue arrows in j). The direction of 
integration (red arrows in j) is aligned with the motion evidence in the motion 
context (top), and with the colour evidence in the colour context (bottom). 
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Extended Data Figure 9 | Model population responses and validation of 
targeted dimensionality reduction. a—e, Model population responses along 
individual task-related axes, same conventions as in Extended Data Fig. 5. Here 
we defined the task-related axes directly based on the synaptic connectivity in 
the model (see Supplementary Information, section 7.6; and panels h-j), rather 
than using the approximate estimates based on the population response (as for 
the PFC data, for example, Fig. 2). The same axes and the resulting projections 
underlie the trajectories in Fig. 5. The model integrates the contextually relevant 
evidence almost perfectly, and the responses along the choice axis (b) closely 
match the output of an appropriately tuned diffusion-to-bound model (not 
shown). Notably, near-perfect integration is not a core feature of the proposed 
mechanism of context-dependent selection (see main text, and Extended Data 
Fig. 10). f, g, Effect of context on model dynamics, same conditions and 
conventions as in Extended Data Fig. 6. Network activity is projected onto the 
two-dimensional subspace capturing the variance due to choice (along the 
choice axis) and context (context axis). Same units on both axes (see scale bars). 
As in Fig. 5, fixed points of the dynamics (red crosses) and the associated right 
zero-eigenvectors (that is, the local direction of the line attractor, red lines) were 
computed separately for motion (top) and colour contexts (bottom) in the 
absence of sensory inputs. The line attractors computed in the two contexts, 
and the corresponding population trajectories, are separated along the context 
axis. f, Effects of context (motion context, colour context), choice (choice 1, 
choice 2) and motion input (direction and coherence, grey colours) on the 
population trajectories. g, Same trials as in f, but re-sorted and averaged to show 
the effect of the colour input (blue colours). The context axis is approximately 
orthogonal to the motion and colour inputs, and thus the effects of motion and 
colour on the population response (Fig. 5) are not revealed in the subspace 
spanned by the choice and context axes (f and g). h-j, Validation of targeted 
dimensionality reduction. To validate the dimensionality reduction approach 
used to analyse population responses in PFC (see Supplementary Information, 
sections 6.5-6.7), we estimated the regression vectors of choice, motion, colour 
and context from the simulated population responses (Fig. 5 and panels 

b-g) and compared them to the exactly known model dimensions that underlie 
the model dynamics (see definitions below). We estimated the regression 
vectors in three ways: by pooling responses from all model units and all trials (as 
in the PFC data, for example, Fig. 2 and Extended Data Fig. 6), or separately 
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from the motion- and colour-relevant trials (contexts). Orthogonalization of 
the regression vectors yields the task-related axes of the subspace of interest (for 
example, axes in Fig. 2). Most model dimensions (motion, colour and context 
inputs, and output) were defined by the corresponding synaptic weights after 
training. The line attractor, on the other hand, is the average direction of the 
right zero-eigenvector of the linearized dynamics around a fixed point, and 
was computed separately for the motion and colour contexts. h, The three 
regression vectors of motion (black arrows), plotted in the subspace spanned by 
the choice axis (that is, the regression vector of choice) and the motion axis (that 
is, the component of the regression vector of motion orthogonal to the choice 
axis). In the colour context, the motion regression vector closely approximates 
the actual motion input (black circle—the model dimension defined by 
synaptic weights). During the motion context, however, the motion regression 
vector has a strong component along the choice axis, reflecting the integration 
of motion evidence along that axis. The motion regression vector estimated 
from all trials corresponds to the average of the vectors from the two contexts; 
thus all three motion regression vectors lie in the same plane. i, The three 
regression vectors of colour (blue arrows) plotted in the subspace spanned by 
the choice and colour axes, analogous to h. The colour regression vector closely 
approximates the actual colour input (blue circle) in the motion context, 

but has a strong component along the choice axis in the colour context. 
Components along the motion (h) and colour (i) axes are scaled by a factor of 2 
relative to those along the choice axis. j, Dot products (colour bar) between the 
regression vectors (horizontal axis) and the actual model dimensions (vertical 
axis), computed after setting all norms to 1. The choice regression vector 
closely approximates the direction of the line attractor in both contexts (squares 
labelled ‘1’). As shown also in h and i, the input regression vectors approximate 
the model inputs (defined by their synaptic weights) when the corresponding 
inputs are irrelevant (squares 2 and 4, motion and colour), whereas they 
approximate the line attractor when relevant (squares 3 and 5). Thus, the 
motion input is mostly contained in the plane spanned by the choice and 
motion axes (h), and the colour input is mostly contained in the plane spanned 
by the choice and colour axes (i). Finally, the single context regression vector is 
aligned with both context inputs (squares labelled 6), and closely approximates 
the difference between the two (not shown). 
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Extended Data Figure 10 | Urgency and instability in the integration 
process. a-d, Choice predictive neural activity (top) and psychometric curves 
(bottom) predicted by several variants of the standard diffusion-to-bound 
model (see Supplementary Information, section 7.7). a, Standard diffusion-to- 
bound model. Noisy momentary evidence is integrated over time until one of 
two bounds (+1 or —1; choice 1 or choice 2) is reached. The momentary 
evidence at each time point is drawn from a Gaussian distribution whose mean 
corresponds to the coherence of the input, and whose fixed variance is adjusted 
in each model to achieve the same overall performance (that is, similar 
psychometric curves, bottom panels). Coherences are 6%, 18% and 50% (the 
average colour coherences in monkey A, Fig. 1b). Average integrated evidence 
(neural firing rates, arbitrary units) is shown on choice 1 and choice 2 trials 
(thick versus thin) for evidence pointing towards choice 1 or choice 2 (solid 
versus dashed), on correct trials for all coherences (light grey to black, low to 
high coherence), and incorrect trials for the lowest coherence (red). The 
integrated evidence is analogous to the projection of the population response 
onto the choice axis (for example, Extended Data Fig. 5b, top left and bottom 
right). b, Urgency model. Here the choice is determined by a race between two 
diffusion processes (typically corresponding to two hemispheres), one with 
bound at +1, the other with bound at —1. The diffusion in each process is 
subject to a constant drift towards the corresponding bound, in addition to the 
drift provided by the momentary evidence. The input-independent drift 
implements an ‘urgency’ signal, which guarantees that one of the bounds is 
reached within a short time. Only the integrated evidence from one of the 
diffusion processes is shown. The three ‘choice 1’ curves are compressed (in 
contrast to a) because the urgency signal causes the bound to be reached, and 
integration towards choice 1 to cease, more quickly than in a. In contrast, 
the ‘choice 2’ curves are not compressed as the diffusion process that 
accumulates evidence towards choice 1 never approaches a bound on these 
trials. c, Same as a, but here the diffusion process is subject to a drift away from 
the starting point (0) towards the closest bound (+1 or —1). The strength of the 
drift is proportional to the distance from the starting point, and creates an 
‘instability’ at the starting point. d, Same as b, with an instability in the 
integration as in c for both diffusion processes. The asymmetry between choice 
1 and choice 2 curves in b and d resembles the asymmetry in the corresponding 
PFC curves (Extended Data Figs 5b, f, upper left). e-j, Neural network model 
with urgency. This model is based on a similar architecture as the model in 
Fig. 4. Unlike the neural network in Fig. 4, which was trained solely based on 
the model output on the last time bin of the trial, here the network is trained 
based on the output it produces throughout the entire input presentation. 
The network was trained to reproduce the integrated evidence (that is, the 
decision variable) for one of the two diffusion processes (that is, one of the 
two ‘hemispheres’) in a diffusion-to-bound model with urgency (b, see 
Supplementary Information, section 7.7). Similar conventions as in Fig. 5. The 
urgency signal is controlled by an additional binary input into the network. 


Here, the urgency and sensory inputs are turned off as soon as a bound is 
reached. The network generates only a single, stable fixed point in each context, 
corresponding to the decision boundary (large red cross). The model also 
implements a series of points of relatively slow dynamics (small red crosses) 
approximately lying on a single curve. The axes of slow dynamics at these slow 
points (red lines) are locally aligned. Notably, responses at these slow points 
have a strong tendency to drift towards the single, stable fixed point (the 
decision boundary), and thus the curve of slow points does not correspond to 
an approximate line attractor. This drift implements the urgency signal and 
causes an asymmetry in the trajectories, which converge on a single point for 
choice 1, but have endpoints that are parametrically ordered by coherence 
along the choice axis for choice 2. As discussed below (panel r), this model relies 
on the same mechanism of selection as the original model (Fig. 5, see main text). 
k-p, Neural network model with instability. Trajectories show simulated 
population responses for a model (same architecture as in Fig. 4) that was 
trained to solve the context-dependent task (Fig. 1) only on high-coherence 
stimuli and in the absence of internal noise (see Supplementary Information, 
section 7.7). Same conventions as in Fig. 5. In the absence of noise, prolonged 
integration of evidence is not necessary for accurate performance on the task. 
As a consequence, the model implements a saddle point (blue cross) instead of 
an approximate line attractor. Points of slow dynamics (small red crosses, 
obscured by the red lines) occur only close to the saddle point. The right zero- 
eigenvectors of the linearized dynamics around these slow points (red lines) 
correspond to the directions of slowest dynamics, and determine the direction 
of the axis of choice. When displaced from the saddle point, the responses 
quickly drift towards one of the two stable attractors (large red crosses) 
corresponding to the choices. For a given choice, trajectories for all coherences 
therefore end in the same location along the choice axis, in contrast to the 
responses in the original model (Fig. 5). Despite these differences, the original 
model (Fig. 5) and the network model with instability (k-p) rely on a common 
mechanism of context-dependent selection (see panel s). q-s, Dynamical 
features (key, bottom) underlying input selection and choice in three related 
neural network models. All models are based on a common architecture (Fig. 4) 
but are the result of different training procedures. q, Dynamical features of the 
model described in the main paper (Figs 5 and 6), re-plotted from Fig. 6c. r, The 
urgency model (e-j). s, The instability model (k—p). In all models, the 
developing choice is implemented as more or less gradual movement along an 
axis of slow dynamics (specified by the locally computed right eigenvectors 
associated with the near-zero eigenvalue of the linearized dynamics, red lines). 
The inputs are selected, that is, result in movement along the axis of slow 
dynamics, depending on their projection onto the selection vector (the locally 
computed left eigenvectors associated with the near-zero eigenvalue). In this 
sense, the three models implement the same mechanisms of context-dependent 
selection and choice. 
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X-ray structure of dopamine transporter 
elucidates antidepressant mechanism 


Aravind Penmatsa'*, Kevin H. Wang'* & Eric Gouaux!? 


Antidepressants targeting Na‘ /Cl- -coupled neurotransmitter uptake define a key therapeutic strategy to treat clinical 
depression and neuropathic pain. However, identifying the molecular interactions that underlie the pharmacological 
activity of these transport inhibitors, and thus the mechanism by which the inhibitors lead to increased synaptic neuro- 
transmitter levels, has proven elusive. Here we present the crystal structure of the Drosophila melanogaster dopamine 
transporter at 3.0 A resolution bound to the tricyclic antidepressant nortriptyline. The transporter is locked in an outward- 
open conformation with nortriptyline wedged between transmembrane helices 1, 3, 6 and 8, blocking the transporter from 
binding substrate and from isomerizing to an inward-facing conformation. Although the overall structure of the dopamine 
transporter is similar to that of its prokaryotic relative LeuT, there are multiple distinctions, including a kink in 
transmembrane helix 12 halfway across the membrane bilayer, a latch-like carboxy-terminal helix that caps the 
cytoplasmic gate, and a cholesterol molecule wedged within a groove formed by transmembrane helices la, 5 and 7. Taken 
together, the dopamine transporter structure reveals the molecular basis for antidepressant action on sodium-coupled 
neurotransmitter symporters and elucidates critical elements of eukaryotic transporter structure and modulation by 
lipids, thus expanding our understanding of the mechanism and regulation of neurotransmitter uptake at chemical synapses. 


Chemical neurotransmission is initiated by Ca**-induced release of 
neurotransmitters into the synaptic cleft’. Upon release into the syn- 
aptic cleft, neurotransmitters such as glutamate, dopamine, noradrena- 
line, serotonin, glycine and GABA (y-aminobutyric acid) activate 
G-protein-coupled receptors and ligand-gated ion channels, resulting 
in excitatory or inhibitory postsynaptic signalling cascades and cur- 
rents’ *. The widespread and critical roles of neurotransmitters in both 
central and peripheral nervous systems necessitate a requirement for 
strict spatiotemporal control of their levels at neural synapses. The 
primary mode of neurotransmitter clearance from the synaptic cleft 
is through secondary active transporters localized in presynaptic cells 
and glial cells that harness ionic gradients, across the cell membrane, to 
drive the uphill transport of neurotransmitters*. This symport process 
requires both Na‘ and Cl ions°, which has led to the solute carrier 6 
(SLC6) family of secondary transporters’ being referred to as neuro- 
transmitter sodium symporters (NSSs)’. 

Dysregulation of NSS function is associated with several debilitat- 
ing disorders that include depression’, attention deficit hyperactivity 
disorder®, orthostatic intolerance’, epilepsy*, Parkinson’s disease” and 
infantile parkinsonism dystonia’. NSSs are also the primary targets of 
antidepressants, drugs to treat neuropathic pain, attention deficit hyper- 
activity disorder, anxiety and of habit-forming substances of abuse 
such as cocaine and amphetamines’. Development of antidepressants 
had a serendipitous beginning in the 1950s"°, followed by the discov- 
ery that the tricyclic antidepressant (TCA) imipramine inhibits nor- 
adrenaline reuptake in tissues'’. Numerous variants of imipramine, 
and the subsequent discovery of selective serotonin reuptake inhibi- 
tors, have revolutionized antidepressant treatment'*'’. To date, 
inhibition of neurotransmitter uptake remains the most widely used 
strategy for antidepressant therapy’’, despite numerous side effects”. 

Gains in our understanding of the molecular mechanisms under- 
lying sodium-coupled transport have benefited from the structures of 


multiple conformations of LeuT'*”, a bacterial sodium-coupled amino 


acid transporter with ~20% sequence identity to the eukaryotic NSSs. 
Models of eukaryotic NSSs based on LeuT have provided valuable insights 
into substrate and ion specificities, pharmacology and transport mecha- 
nisms in NSS members'*'?. However, bacterial NSS models fall short of 
answering questions concerning the elements of NSS structure and func- 
tion, including the local structure of NSSs in regions that are unrelated 
to LeuT in amino acid sequence, the determinants of substrate select- 
ivity and the atomic-level details of transport inhibition by antidepres- 
sants and addictive compounds. Moreover, there is no understanding, 
at the level of three-dimensional structure, of the role of lipids and post- 
translational modifications in NSS structure and mechanism. 

Here we present a 3.0 A X-ray crystal structure of the Drosophila 
melanogaster dopamine transporter (DAT)” in complex with the TCA 
nortriptyline. The Drosophila DAT has greater than 50% sequence 
identity with its mammalian counterparts and harbours a pharmaco- 
logical profile that is a hybrid of the mammalian DATs, noradrenaline 
transporters (NETs) and serotonin transporters (SERTs), making it a 
powerful vehicle to study NSS pharmacology and substrate specificity”. 
The DAT structure reveals atomic details of TCA recognition, novel struc- 
tural elements of NSS protein architecture and suggests a role for choles- 
terol in the allosteric control of transport in eukaryotic NSS members. 


Thermostabilization and crystallization 

Wild-type Drosophila DAT is labile, loses ligand-binding activity upon 
detergent extraction from the cellular membranes and is refractory to 
crystallization. To stabilize DAT for functional characterization, anti- 
body generation and crystallization, we screened single point mutants 
for ligand-binding activity at increased temperatures”, ultimately combi- 
ning five mutations into the construct used for crystallization and struc- 
ture determination (DAT ..y.; Supplementary Fig. 1). Purified DAT ays 
binds to the high-affinity inhibitor nisoxetine with a dissociation constant 
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(Ka) of 29 nM (Supplementary Fig. 2a), and the TCA nortriptyline exhi- 
bits an inhibition constant (K;) of 156nM (Supplementary Fig. 2b). 
Unfortunately we were unable to measure the binding of nortriptyline 
to wild-type DAT because of its instability. Nortriptyline has a K; of 
18nM at human SERT and 4.4nM at human NET”, values that are 
~9-fold and ~35-fold lower than that for Drosophila DAT ys. In 
dopamine-uptake measurements with the wild-type DAT and amitrip- 
tyline, a precursor of nortriptyline, transport is inhibited with a K, of 
30nM”, whereas the DAT ys: construct is inactive in transport 
(Supplementary Fig. 2c, d). Crystallization was further enhanced by the 
use of a complex with a Fab, resulting in crystals of a DAT ays—-Fab 
complex that diffract X-rays to 3.0 A resolution. 


Architecture of DAT 


The structure of Drosophila DAT ys bound to nortriptyline exhibits 
an outward-open conformation whereby the antidepressant is bound 
in a cavity halfway across the membrane bilayer and accessible to solvent 
from only the extracellular side of the membrane (Fig. 1). The trans- 
porter displays an overall LeuT-like fold with 12 transmembrane helices 
(TMs) in which helices 1-5 and 6-10 are related by inherent pseudo- 
symmetry, akin to LeuT’* (Supplementary Fig. 3). Residues in TM1 
and TM6 make numerous interactions with the ligand and ions via non 
helical, hinge-like regions at the approximate mid-points of these TMs, 
connecting the bonding networks of all three ions with the inhibitor. 
Residues at the bend in TM3 contribute to the hydrophobic pocket that 
cradles the tricyclic moiety of the ligand, which lays approximately 
perpendicular to the TMs, mimicking a wedge separating the jaws of 
a vice. One cholesterol molecule is located in a groove between TM5 
and TM7 and poised to modulate the movement of TM1a that occurs 
during the transport cycle (Fig. 1a)’. 

The primary binding site accommodates nortriptyline but cannot 
adopt the subsequent helical movements of TMs 1b and 6a required to 
form the occluded state. Using LeuT for comparison, the occluded state 
of LeuT is formed in the presence of sodium and leucine substrate, but 
not in the presence of tryptophan, which binds to the primary site, 
comparable to the TCA in the context of DAT... We propose that 
both tryptophan and TCA stabilize the outward-open conformations 
of LeuT and DAT, respectively, by targeting the primary binding 
site and sterically blocking the extracellular domains of the transporter, 
preventing the extracellular gate from closing and thus acting by way 
of a foot-in-the-door mechanism (Supplementary Fig. 4a, b and Sup- 
plementary Table 2). 


Figure 1 | Architecture of Drosophila DAT ys. a, Structure of DAT. 


cryst 
viewed parallel to membrane. Nortriptyline, sodium ions, a chloride ion and a 
cholesterol molecule are shown in sphere representation in magenta, purple, 


green and yellow, respectively. b, View of DAT ys from the extracellular face. 


cryst 
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Whereas the core of DAT ays closely resembles that of LeuT, the 
periphery of DAT .,.y¢ exhibits several features distinct from LeuT and 
important for neurotransmitter transport and cellular localization. In 
TM12, a kink in the centre at Pro 572 causes the second half of the helix 
to turn away from the transporter, indicating that the dimerization inter- 
face of LeuT is not the same as potential oligomerization interfaces of 
eukaryotic NSSs (Fig. la, b and Supplementary Fig. 5). Although 
previous studies indicate that NSSs oligomerize”**, DAT .,ys¢ is mono- 
meric in detergent micelles and in the crystal lattice (Supplementary 
Fig. 6), thus suggesting that a membrane bilayer or additional mole- 
cules may be required for NSS assembly. The variable extracellular 
loop 2 (EL2) region has numerous predicted N-linked glycosylation 
sites” and one disulphide bond”®, modifications that have critical roles 
in proper trafficking of NSSs to the plasma membrane’. The strictly 
conserved disulphide linkage”® was observed in the structure between 
two conserved cysteines, Cys 148 and Cys 157 (Supplementary Fig. 7). 
In the crystal, EL2 has a central role in lattice contacts, packing against 
a neighbouring Fab with an 870 A? interface (Supplementary Fig. 6). 
Because 43 residues were deleted from EL2 in the DAT .:ys_ construct, 
further studies are required to determine the role of the full-length EL2 
in transporter structure and function (Supplementary Fig. 8a). Toge- 
ther with EL2, EL4 harboursa Zn7* -binding site in mammalian DATs 
that modulates transport’. The equivalent residues in Drosophila 
DAT ays are within a Cox—Co distance of 10 A, but because their identi- 
ties are Glu 161, Leu374 and Ala 395, they do not form a high-affinity 
Zn** -binding site in DAT cyst. 


TCA-binding site 

Unambiguous density for nortriptyline in DAT yst was observed in 
the primary site, approximately halfway across the membrane bilayer 
(Fig. 2a). In accordance with previous chimaeric studies, swapping 
TM regions between NET and DAT”, the drug-binding site is sur- 
rounded primarily by helices 1, 3, 6 and 8 in a region equivalent to the 
substrate-binding pocket of LeuT", and in close proximity to the 
densities for sodium and chloride ions (Fig. 2a). The dibenzocyclo- 
heptene ring of nortriptyline is oriented as a saddle, curving around 
the central region of TM3 and engaging in hydrophobic interactions 
with Val 120, Tyr 124 and Ala117 (Fig. 2b). Val 120 is extensively 
conserved and faces the cycloheptene ring (Supplementary Fig. 4a), 
and replacement of the corresponding Ile 172 in human SERT with larger 
substitutions such as methionine markedly reduce affinity towards most 
NSS inhibitors’’. This location was previously found in human SERT 


c 


Extracellular 
vestibule 


Nortriptyline 


c, Surface representation showing that ligand and ion binding sites are 
accessible from the extracellular vestibule. Nortriptyline and TMs 1, 3, 6 and 8 
are coloured as in a. 
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to be protected from crosslinking agents in the presence of inhibitor or 
substrate”. Phe 325 in TM6b forms an edge-to-face aromatic inter- 
action with one of the benzyl groups of nortriptyline. Residues Gly 425 
(TM8) and Ala 479 (‘TM10) also interact with the tricyclic group of the 
drug. The N-methylpropylamine group of the drug extends across the 
width of the drug-binding site and prevents TMs 1b and 6a from 
closing the extracellular gate ‘above’ the drug. The amine group forms 
a hydrogen bond with the main-chain carbonyl of Phe 43 and a cation- 
mt interaction with the side chain of Phe 43 (Fig. 2b). Interestingly, 
residues equivalent to Val120 and Phe 43 (Ile 172 and Tyr95) in 
SERT are necessary for interactions with antidepressants”. 

The biogenic amine transporters harbour a crucial aspartate residue 
in TM] and in the DAT .ys_ structure we see how Asp 46 substitutes for 
the absence of the carboxylate group in biogenic amines as compared 
to amino acid substrates transported by LeuT and the GABA and glycine 
transporters (Supplementary Fig. 8b)'°. The side chain of Asp 46 forms 
a hydrogen bond with the hydroxyl of Tyr 124, which is equivalent to 
the Tyr 108 residue in LeuT that has a role in substrate recognition 
(Fig. 2c)*’. Mutations at this aspartate result in substantial losses in 
transport activity and reduced binding affinities for cocaine*. Ser 421 
(TM8), which coordinates a sodium ion at site 2 (Na2), is within 3.5 A 
of the propylamine group of the TCA and also forms a hydrogen bond 
with the carbonyl of Phe 43. Ser 421 therefore participates in a network 
of hydrogen bonds that interconnects nortriptyline with the Na2 site 
and was also found to be crucial for high-affinity recognition of anti- 
depressants by human SERT”. 

The N-methyl group of nortriptyline is 3.1 A away from the main- 
chain carbonyl of Phe 319 and sterically prevents Phe 319 and TM6a 
from closing the extracellular gate above the drug, thereby stabilizing 
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Figure 2 | Antidepressant-binding site. 

a, Overall view of the nortriptyline-bound 

DAT ayst- Fo - Fc densities (blue mesh) for drug and 
ions are contoured at a levels of 2.5 and 3.0, 
respectively. b, Close-up view of the drug binding 
pocket. Na* and Cl” ions are shown as spheres. 
Nortriptyline is represented as sticks (magenta). 
The amino group of nortriptyline is 2.7 A from the 
carbonyl oxygen of Phe 43 (TM1a) and the 
N-methyl group of nortriptyline is 3.1 A 

from the carbonyl oxygen of Phe 319. Residues 
lining the drug-binding pocket with interfacial 
areas greater than 10 A? are represented as sticks. 
c, Comparison of the drug or substrate binding 
pocket of DAT .ys¢ with that of LeuT (PDB code 
2A65). The distance between the carboxylate 
group of leucine and Tyr 108 (spheres) is 2.7 A in 
the occluded state (2A65) and 5.1 Ain the 
inhibitor-bound state (3F3A) of LeuT, whereas the 
equivalent interaction in DAT ays¢ between Asp 46 
and Tyr 124 is 3.1 A. 


— 


the outward-open state of the transporter. Phe 319 is the equivalent of 
Phe 253 in LeuT, which gates the substrate-binding pocket (Fig. 2c)'*"”. 
The relative position of Phe 319 is markedly different from Phe 253 in 
the substrate-bound, occluded structure of LeuT, and instead resem- 
bles the positions of Phe 253 in the substrate-free and inhibitor-bound 
structures. To address the question of whether nortriptyline could bind 
to DAT ys in a LeuT-like, occluded conformation, we superimposed 
DAT ys onto the occluded state of LeuT and found that Phe 319 and 
Phe 325 would clash with the dibenzocycloheptene ring of the TCA 
(Fig. 2c and Supplementary Fig. 4c). Identification of nortriptyline 
bound in the substrate-binding pocket of DATays_ provides the first 
structural evidence that TCA antidepressants inhibit neurotransmitter 
transporters by preventing substrate binding and stabilizing the outward- 
open conformation’*’°**, The DAT .,y:-nortriptyline complex, together 
with the LeuBAT-antidepressant complexes (Wang, H. et al., unpub- 
lished observations), demonstrate conclusively that antidepressants 
inhibit NSSs by acting at the primary or S1 site, in stark contrast to 
how TCAs inhibit LeuT via a non-competitive mechanism” by bind- 
ing within the extracellular vestibule**’. 


Ion-binding sites 

Locations of ions essential for transport could be identified in DAT ayst 
with electron densities (>4.0c0) at three locations near the non-helical 
hinge-like regions of TMs 1 and 6, and close to the TCA. Densities at 
the two sites coincided exactly with Nal and Na2 sites identified in 
LeuT (Fig. 3a, b)'*. A chloride ion was positioned at the third position 
of high omit density nestled in between TMs 2, 6 and 7 and close to Nal 
(Fig. 3a). Placing ions in the omit densities during model building led to 
a concomitant loss of F,- F- density during refinement. The atomic 


Figure 3 | Ion-binding sites. a, Nal and chloride 
ion binding sites. Na* is purple and Cl is green 
and both are modelled as spheres. b, Coordination 
at the Na2 site is trigonal bipyramidal with the 
water molecule (w, red sphere) 3.3 A from the 
sodium ion. Distances are in angstroms for 
residues that are in the coordination sphere and 
interactions are shown by dashed lines. Residues 
are coloured according to their respective TMs. 
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displacement factors of the ions matched the B-values of surrounding 
atoms. The sodium at site 1 is located ~5.2A away from the amino 
group of nortriptyline and is coordinated with an octahedral geometry 
by side-chain oxygens of Asn 49, Ser 320 and Asn 352, and main-chain 
carbonyls of Ala 44 and Ser 320 (Fig. 3a and Supplementary Fig. 8b). 
Interestingly, the sodium at Na1 is also coordinated by one water mole- 
cule which in turn is within hydrogen-bonding distance to Asp 46, thus 
showing that the Asp in TM1 indirectly participates in the sodium ion 
coordination. The mean ion coordinating distances (2. 6 A) at this site 
are longer than the distances (2.42 A) reported for Na* ions in solution 
but shorter than the distances reported for K* ions (2.84 A; Supplemen- 
tary Table 3)”. 

The chloride ion is located 5.0 A away from the Nal site at a posi- 
tion previously identified by computational and mutational studies 
based on LeuT*® and the GABA transporters*’. A recent structural 
study of a chloride-dependent E290S mutant of LeuT also identified a 
chloride ion at this location*’. Chloride is coordinated in a tetrahedral 
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Cholesterol 


Figure 4 | Cholesterol site. a, Cholesterol (yellow 
sticks) shown with F, — F, density (light magenta) 
contoured at 2.00. Residues that interface with the 
cholesterol group are represented as sticks. 

b, Potential role of cholesterol in maintaining an 
outward-open state of transporter. Cholesterol 
(sticks with transparent surface) sterically clashes 
with the position of TM1a in the inward-open 
conformation of LeuT (PDB code 3TT3)’”. 


TMta (Leun 
“ed 
TM1a (DAT) 


fashion through residues in TM6 (Ser 320, Gln 316), TM7 (Ser 356) and 
TM2 (Tyr 69) (Fig. 3a). Interestingly, the hydroxyl group of Ser 320 
bridges the Nal and Cl sites and is positioned to interact with both ions. 
The mean ion-ligand distances at the Cl” site are 3.0 A (Supplementary 
Table 3) and the B-factors of surrounding atoms are similar to that of 
chloride, supporting the placement of chloride at this site. 

The sodium at the Na2 site is located ‘below’ the plane of the drug 
towards the cytoplasmic face, in between TMs 1 and 8, and is coordi- 
nated in a trigonal bipyramidal fashion by main-chain carbonyls from 
Gly 42 (TM1a), Val 45 (TM1-hinge) and Leu 417 (TM8), and the side-chain 
oxygens from Ser 421 and Asp 420 (TMB) (Fig. 3b). The mean ion-oxygen 
distances are 2.4 A, in line with reported values for sodium coordina- 
tion in solution (Supplementary Table 3). Although the interconnected 
network of interactions between TMs 1, 6, nortriptyline, sodium and 
chloride provides a structure-based mechanism for the coupling of ion 
and inhibitor binding”, we do not have a comprehensive understand- 
ing of the ion dependence of inhibitor binding in NSSs. 


Figure 5 | Extracellular and cytoplasmic gates 
and the C-terminal latch. a, Relative locations of 
the open extracellular gate (red box), closed 
cytoplasmic gate (blue box) and C-terminal latch 
(green box) in DATayst- b, The width of the 
extracellular gate is depicted by the distances 
between Tyr 124 and Phe 319 (10 A), and Arg 52 
and Asp 475 (10 A). Nortriptyline, ions and helices 
are coloured as in Fig. 1. c, The cytoplasmic gate is 
closed by polar and electrostatic interactions 
between TM1la, TM2, IL1, TM6b and TM8. d, The 
C-terminal helix following TM12 is bound to the 
cytoplasmic face of the transporter via polar 
interactions with IL1. Polar and electrostatic bonds 
are represented as grey dashed lines. 
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Cholesterol-binding site 


A cholesterol molecule is lodged in a trough-shaped cavity bordered 
by TM5, TM7 and TM 1a at a depth equivalent to the inner leaflet of the 
membrane (Fig. 4a). Branched aliphatic residues are primarily involved 
in forming the protein-cholesterol interface (359 A”), thus allowing choles- 
terol to bury ~57% of its solvent-accessible surface area. F, - F. density 
for this site clearly demarcated the orientation of the isooctyl group 
of cholesterol anchored at the junction of TMs 5 and 7 by residues 
Leu 276, Leu 277 and Ile 358. The B-face of the sterol ring primarily 
faces residues Tyr 273, Leu 270 and Trp 266 in TM5 and also interacts 
with residues Val 34, Leu 37, Leu 38 and Ile 41 on TM1la. The a-face of 
cholesterol interfaces with residues Leu347 and Ile351 in TM7 
(Fig. 4a). 

Cholesterol has an important role in modulating the function of 
NSS members**™, stabilizing an outward-open state of DAT with a 
concomitant increase in maximum binding or B,,x for cocaine®. In 
LeuT, TM1a undergoes a large conformational change upon transition 
from the outward-facing open and occluded states to the inward-open 
state’’. If a similar conformational change were to occur in DAT, it 
would entirely disrupt the cholesterol site (Fig. 4b). We propose that 
one mechanism for the action of cholesterol on DAT is that by occupy- 
ing its binding site in the outward-open, inhibitor-bound state, chole- 
sterol stabilizes the outward-open conformation of the transporter®. 


Extracellular and cytoplasmic gates 
The ion and ligand binding sites in DAT ..ys, are accessible to solvent 
from the extracellular face owing to the open gate above the primary 
binding pocket. The distance between Tyr 124 of TM3 and Phe 319 
in TMéa is 10 A, whereas in the substrate-bound, occluded state of 
LeuT the corresponding distance is half as long (Fig. 5a, b). Similarly, 
the 10 A separation between Arg 52 on TM1b and Asp 475 on TM10 
renders the primary binding site accessible to extracellular solution. 
The steric bulk of the tricyclic moiety combined with the extended 
N-methylpropylamine chain of nortriptyline prevents both TM1b and 
TMé6a from approaching TM3 and TM8 to cap the putative substrate 
pocket and close the gate. 

In contrast to the extracellular gate, extensive polar interactions at 
the intracellular face of the transporter form a thick barrier of ~24A 
between the ligand and ion pockets and solvent to keep the cytoplasmic 


Figure 6 | Mechanisms of antidepressants and cholesterol. a, The TCA 
nortriptyline (magenta) wedges between scaffold helices 3 and 8 and the core 
helices 1 and 6, preventing the movement of TMs 1b and 6a from closing the 
extracellular vestibule. b, Cholesterol (yellow) is bound in an intracellular 
pocket and hinders the movement of TM1a, thereby stabilizing the outward- 
open conformation of DAT. The C-terminal latch interacts with IL1 as part of 
the cytoplasmic gate. 


ARTICLE 


gate shut. At the cytoplasmic face of the transporter, the indole nitrogen 
of Trp 30 caps the carbonyl oxygen of Tyr 331 in TM6b, and Arg 27 forms 
a salt bridge to Asp 435 of TMB (Fig. 5c). Arg 27, Trp 30 and Asp 435 
are strictly conserved in NSS orthologues and LeuT, suggesting that 
these intracellular gate interactions are general and important facets 
of the transport mechanism for this family of sodium symporters**”’. 
Tyr 334, the residue corresponding to Tyr 335 in human DAT, was 
previously shown to be responsible for shifting the conformational 
equilibrium of the DAT towards an inward-open state*. 


C-terminal latch 


Two novel attributes at the C terminus of DAT .,y.¢ were immediately 
evident from the structure. Helix 12 is shifted by 22° in comparison to 
its position in LeuT, resulting in the exposure of TM3 to solvent and 
lipid (Supplementary Fig. 5). Pro 572, conserved in most eukaryotic 
NSS members, is probably at the root of the kink between TMs 12a 
and 12b, and thus has an important role orienting the second half of 
the helix away from the rest of the transporter. The hairpin between 
TM12b and the intracellular C terminus of DAT ays is stabilized by 
hydrogen bonding between the e-nitrogen of Arg589 and carbonyl 
oxygen of Thr 582 (Fig. 5d). The second feature is the C-terminal helix, 
which contains 2.5 turns from residues 586 to 595, where several 
hydrogen bonds and a cation-x interaction between Trp 597 and 
Arg 101 restrain this C-terminal helix near intracellular loop 1 (IL1) 
at the cytoplasmic face of DAT ..y.- Although sequence conservation 
within TM12 and the C terminus is rather low across NSS orthologues, 
Gly 584 is located at the hairpin hinge and strictly conserved, and only 
Lys or Arg is present at the position equivalent to Arg 589 of DAT ayst, 
suggesting that the conformation of the C terminus in the structure is a 
conserved feature among NSS orthologues. Studies of human DAT have 
identified the region following TM12 to contain sites for protein kinase 
C-mediated endocytic trafficking*”, and it is plausible that phosphor- 
ylation may alter the conformation or accessibility of the C terminus, 
allowing it to interact with cellular machinery for internalization. We 
also note that the latch participates in interactions with IL1, which in 
turn interacts with TMla, thus suggesting that the C-terminal latch 
may modulate transporter activity. 


Conclusion 


The structure of DAT,,.¢ captures the transporter in an inhibitor- 
bound, outward-open conformation. The TCA nortriptyline targets 
the primary substrate site and stabilizes the open conformation by 
sterically preventing closure of the extracellular gate (Fig. 6a). One 
chloride and two sodium ions are located adjacent to the ligand, 
suggesting that the binding of ions and inhibitor are directly coupled. 
A cholesterol molecule bound to a crevice flanking TM1a probably 
stabilizes the outward-open, inhibitor-bound conformation (Fig. 6b). 
The structure reveals a C-terminal latch that makes extensive inter- 
actions with the cytoplasmic face of the transporter, proximal to the 
cytoplasmic gate, and thus in a position to modulate transport activ- 
ity. Taken together, the structure of a eukaryotic DAT reveals novel 
insights into antidepressant recognition and structural elements impli- 
cated in the regulation of neurotransmitter transport, providing a foun- 
dation for drug design strategies. 


METHODS SUMMARY 


The Drosophila DAT ys construct (Supplementary Fig. 1) was expressed 
in virus-infected mammalian cells and purified by affinity and size- 
exclusion chromatography. Fab 9D5 was added before crystallization 
along with nortriptyline (1 mM) ata DAT:Fab molar ratio of 1:1.1 and 
concentrated down to 3 mg ml’. Crystals of the complex were obtained 
in the presence of 100 mM glycine, pH 9, and 38% polyethylene glycol 
(PEG) 350 monomethyl ether (MME). The structure was solved by 
molecular replacement using a polyalanine model of LeuT (PDB code 
3F3A) and an ensemble of Fab variable and constant domains. Data 
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processing, model building and refinement were performed using stan- 
dard crystallographic software (Supplementary Table 1). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Screening, construct optimization and protein expression. The Drosophila DAT 
was selected as a promising candidate for structural studies after screening multiple 
orthologues of DATs and NETs by fluorescence-detection size-exclusion chro- 
matography (FSEC)”. In addition, FSEC was used to screen other parameters such 
as detergent efficacy, thermostability, lipid effects, tertiary epitope-specific mono- 
clonal antibodies and sample homogeneity following purification. The DNA 
encoding the D. melanogaster DAT was provided by S. Amara and was modified 
by removal of the first 20 amino acids (A1-20), by a deletion in EL2 (A164-206) 
and by point mutations to enhance thermostability (V74A, V275A, V311A, L415A, 
G538L) by PCR-based methods. This modified DAT sequence, deemed DAT ast, 
was fused to a C-terminal GFP-His, tag with a thrombin cleavage site (LVPRGS) in 
place of residues 602-607. DAT <ry-GFP-—Hisg was produced by virus-mediated 
expression in mammalian cells”. 

Antibody production. Monoclonal antibodies against DAT ays were raised by D. 
Cawley using standard methods. Antibodies were screened by FSEC and western 
blot to select clones that recognized natively folded DAT .,.y_ protein. Sequencing 
of Fab regions was performed on mouse hybridoma cells (Fusion Antibodies) and 
on the intact antibody protein by Edman degradation (by M. A. Gawinowicz). 
Antibody was purified from hybridoma supernatant using 4-mercapto-ethyl- 
pyridine resin. Fab protein was generated by papain cleavage of full-length anti- 
body, followed by F, capture on Protein A resin and cation exchange. Fab was 
stored in 20 mM sodium acetate, pH 5, 250 mM NaCl and 10% glycerol. 
Purification of DAT ys. Membranes were solubilized in TBS (20mM Tris, 
pH8, 150 mM NaCl) containing n-dodecyl-f-p-maltoside (DDM) at a w/w ratio 
of 0.1 g detergent per 1 g membrane. The detergent-soluble fraction was incu- 
bated with cobalt-charged metal ion affinity resin, and DAT .,,.-GFP—Hisg was 
eluted with 100 mM imidazole in 20 mM Tris, pH 8, 300 mM NaCl, 5% glyce- 
rol, 14M lipids (1-palmitoyl-2-oleoyl-sn-glycero-3-phosphocholine (POPC), 
1-palmitoyl-2-oleoyl-sn-glycero-3-phosphoethanolamine (POPE) and 1-palmitoyl- 
2-oleoyl-sn-glycero-3-phospho-(1’-rac-glycerol) (POPG) at a weight ratio of 3:1:1), 
1mM DDM and 0.1 mM cholesteryl hemisuccinate . After thrombin digestion to 
remove the GFP-Hisg tag, DAT ays: was isolated by size-exclusion chromato- 
graphy in 20mM Tris, pH 8, 100mM NaCl, 5% glycerol, 14,.M POPE, 4mM 
decyl-maltoside and 0.1 mM cholesteryl hemisuccinate. The purified DAT ayst 
protein was mixed with Fab 9D5 at a molar ratio of 1:1.1 and used for crystalliza- 
tion trials at 3mg ml ' in the presence of 1 mM nortriptyline. 

Crystallization. Crystals grew in 100 mM glycine, pH 9 and 38% PEG 350 MME 
using a drop ratio of 1 pl protein and 0.5 ul reservoir solution by hanging drop 
vapour diffusion. Initial crystals appeared at 4 °C after 2 days, reaching full size 
after 7 days. Crystals were flash frozen in liquid N, directly and used for X-ray 
diffraction data collection. 

Structure determination. X-ray data were collected at the Advanced Photon Source 
(Argonne National Laboratory, beamline 24-ID-C). Data were indexed, integrated, 
and scaled using HKL2000 (ref. 54) (Supplementary Table 1). The structure was 
solved using molecular replacement, with ensembles of constant (constant domains 
of heavy and light chains as one set) and variable domains (variable domains of 
heavy and light chains as a second set) of Fab coordinates in the PDB along with a 
polyalanine model of LeuT (PDB code 3F3A). A multi-model search was done 
using Phaser”. Initial phases were improved by iterative steps of manual model 
building, refinement and maximum-likelihood density modification using Coot’, 
Phenix Refine*’ and Phenix Phase and Build”, respectively. Multiple rounds of 
refinement led to the placement of a majority of main-chain and side-chain atoms 
for both the Fab and DAT ys. The structure was refined to acceptable R-factors 
(Supplementary Table 1) with residues 20-24 and 600-605 in DAT ays and 135- 
138 in the heavy chain unmodelled owing to poor density. Nortriptyline, ions and 
cholesterol molecules were placed into F, - F, density contoured at 2o or greater in 
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the putative substrate pocket, ion sites and at the periphery of the transporter. 
Stereochemistry was evaluated using MolProbity”. 

Ligand binding and uptake measurements. Scintillation-proximity assays using 
transporter solubilized in detergent® were carried out using copper yttrium sili- 
cate (Cu-YSi) beads (Perkin Elmer) at 0.5 mg ml ~ 1 30 nM °H-labelled nisoxetine 
(1:9 *H:'H) and 10nM DAT-,ys:-GFP-Hiss protein in the same buffer as that 
used for size-exclusion chromatography. Unlabelled nortriptyline was used as the 
competitor ligand. Assay plates were read using a MicroBeta TriLux 1450 LSC & 
Luminescence counter. Data were fitted using a standard single site competition 
equation, and K; values were calculated from the ICs» values using the Cheng- 
Prusoff equation. 

Uptake assays were performed using HEK293 cells expressing respective mutant 

constructs. Cells were re-suspended in 10 1M *H-dopamine (1:49 *H:'H) contain- 
ing uptake buffer made with 25 mM HEPES-Tris, pH 7.1, 130mM NaCl, 1mM 
MgSO,, 5mM KCl, 1mM CaCl, 5mM p-glucose and 1 mM L-ascorbic acid”. 
Control samples were pre-incubated with 10 1M cold desipramine before addition 
of label. Assays were quenched with cold uptake buffer containing 1 1M desipra- 
mine after 10 min, cells were washed twice with cold uptake buffer and activity was 
measured from solubilized cells by scintillation counting. Data were plotted using 
Origin 7.0. 
Thermostability screening of DAT. Sites for mutagenesis were selected on the 
basis of a model of Drosophila DAT built on the template of LeuT and residues 
were altered to Ala, Leu or Phe’’. Individual mutants along with the wild-type 
construct were transfected into HEK293 cells and kept in culture for 48h, then 
tested for binding activity after detergent solubilization. Samples were split and 
one part was kept at 4°C, and the other portion of lysate heated at 40 °C for 
10 min. *H-nisoxetine was added before heating to select for mutants that stabil- 
ize an inhibitor-bound state of the transporter. Scintillation proximity assay was 
used to monitor activity in a high-throughput format. Mutants that consistently 
had an increased melting temperature (T,,) compared to wild-type (Tm = 35 °C) 
were chosen and pooled into one construct, which yielded a five-mutant con- 
struct with a T,, of ~60 °C. 
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The enormous size and cost of current state-of-the-art accelerators 
based on conventional radio-frequency technology has spawned 
great interest in the development of new acceleration concepts that 
are more compact and economical. Micro-fabricated dielectric laser 
accelerators (DLAs) are an attractive approach, because such dielec- 
tric microstructures can support accelerating fields one to two orders 
of magnitude higher than can radio-frequency cavity-based accel- 
erators. DLAs use commercial lasers as a power source, which are 
smaller and less expensive than the radio-frequency klystrons that 
power today’s accelerators. In addition, DLAs are fabricated via low- 
cost, lithographic techniques that can be used for mass production. 
However, despite several DLA structures having been proposed 
recently’ *, no successful demonstration of acceleration in these struc- 
tures has so far been shown. Here we report high-gradient (beyond 
250 MeV m_') acceleration of electrons in a DLA. Relativistic (60-MeV) 
electrons are energy-modulated over 563 + 104 optical periods of a 
fused silica grating structure, powered by a 800-nm-wavelength 
mode-locked Ti:sapphire laser. The observed results are in agree- 
ment with analytical models and electrodynamic simulations. By 
comparison, conventional modern linear accelerators operate at 
gradients of 10-30 MeV m“", and the first linear radio-frequency 
cavity accelerator was ten radio-frequency periods (one metre) long 
with a gradient of approximately 1.6 MeV m“' (ref. 5). Our results 
set the stage for the development of future multi-staged DLA devices 
composed of integrated on-chip systems. This would enable com- 
pact table-top accelerators on the MeV-GeV (10°-10° eV) scale for 
security scanners and medical therapy, university-scale X-ray light 
sources for biological and materials research, and portable medical 
imaging devices, and would substantially reduce the size and cost of 
a future collider on the multi-TeV (10'” eV) scale. 

In a dielectric laser accelerator, the large-amplitude fields respon- 
sible for the accelerating force are provided by a laser pulse. Of par- 
ticular interest are lasers that operate in the optical or near-infrared 
(NIR) portion of the electromagnetic spectrum to take advantage of 
the low loss and high damage threshold of dielectric materials at these 
wavelengths. Direct acceleration of electrons using a NIR laser was 
demonstrated using inverse transition radiation®. That approach, how- 
ever, requires a material boundary in the path of the electron beam and 
has a limited interaction length. To achieve scalable acceleration, the 
electric field must have a component parallel to the electron beam 
trajectory and a (speed-of-light) phase velocity that matches that of 
the (relativistic) particle beam. 

Previous efforts to accelerate particles with lasers have used a variety 
of techniques to phase-match the electron beam to a co-propagating laser 
field. The inverse Cerenkov accelerator”® uses a gas to slow the phase 
velocity of the laser light. The inverse free electron laser accelerator” 
requires an undulator at its resonance condition” to give the electron a 
synchronous velocity component. However, using a gas cell leads to 
distortions of the electron beam due to multiple scattering, and use ofa 
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permanent magnet undulator limits the compactness of such a struc- 
ture and introduces deleterious synchrotron radiation effects. 

An alternative way to satisfy the phase-velocity condition is by cre- 
ating tailored longitudinal modes in near-field structures. One such 
approach, the inverse Smith—Purcell accelerator’, has been demon- 
strated using a metallic grating’’, and more recently, in a dielectric 
grating with a NIR laser™*. In configurations that minimize the transverse 
forces, these open structures do not support speed-of-light longitudinal 
eigenmodes and are therefore useful only at sub-relativistic particle 
energies. Additionally, they produce an exponentially decaying accel- 
erating field pattern, which distorts the beam. To address these issues, 
designs using waveguides and photonic crystals have been proposed’. 
These structures present challenging fabrication tolerances, and the 
required modes are difficult to excite efficiently; as a result, no success- 
ful demonstration has been shown. To circumvent these challenges, we 
use a planar, phase-reset grating accelerator structure’, which permits a 
relatively simple fabrication process and excitation mechanism. 

The DLA is fabricated out of fused silica’* and operates at the wave- 
length (2 = 800 nm) of commercially available Ti:sapphire lasers. The 
structure consists of two opposing binary gratings of period /p, sepa- 
rated by a vacuum gap of height g, where the electron beam travels 
perpendicular to the grating rulings. A scanning electron microscope 
(SEM) image of the structure’s longitudinal cross-section is shown in 
Fig. 1a. To generate the required accelerating fields, a linearly polarized 
laser pulse is incident on the structure perpendicular to both the elec- 
tron beam direction of propagation and the plane of the gratings. The 
laser pulse generates a series of grating diffraction modes inside the 
vacuum channel, and by matching the grating period to the laser 
wavelength (Ap = 4) we achieve phase synchronicity between the 
strong first space harmonic and the particle beam”’. 

The structure essentially acts as a longitudinally periodic phase 
mask, where each grating pillar imparts a m-phase shift on the electric 
field, with respect to the adjacent vacuum space (Fig. 1b inset). As a 
result, electrons launched at the correct optical phase (laser-electron 
timing) remain phase-synchronous and experience net energy gain. In 
our experiment, the electron beam has a duration of 429 + 31 fs, much 
longer than the laser optical cycle (A/c = 2.6 fs, where c is the velocity of 
light). Electrons therefore sample all phases of the laser field, causing 
the laser-electron interaction to manifest as an energy modulation 
(broadening), with some electrons gaining energy from acceleration 
while some are decelerated. 

Figure 1b depicts a diagram of the experiment performed at the 
Next Linear Collider Test Accelerator (NLCTA) facility at the SLAC 
National Accelerator Laboratory. The incident 60-MeV electron beam 
is focused by a permanent magnetic quadrupole triplet to a root mean 
square (r.m.s.) spot size of 24 um X 8 um. The sample contains multiple 
550-1m-long (~687 optical periods) DLA structures with a period of 
800 nm and apertures of both 500 1m X 400 nm and 500 um X 800 
nm. The 400-nm gap structure provides a higher acceleration gradient 
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Figure 1 | DLA structure and experimental set-up. a, Scanning electron 
microscope image of the longitudinal cross-section of a DLA structure 
fabricated as depicted in Extended Data Fig. 1a. Scale bar, 2 um. 

b, Experimental set-up. Inset, a diagram of the DLA structure indicating the 


than an 800-nm gap structure but requires tighter tolerances on the 
electron beam. 

The NIR pulses, 1.24 + 0.12 ps long, from a regeneratively amplified 
Ti:sapphire mode-locked laser are focused to an r.m.s. spot size of 
30 um X 300 tm at the interaction point. We use a motorized four- 
axis stage for precise alignment of the structure with the electron beam. 
Once aligned, the electron beam leaving the structure goes through a 
point-to-point focusing spectrometer magnet, which disperses the 
outgoing electron beam in energy onto a Kodak Lanex phosphor 
screen that is imaged by an intensified CCD (charge-coupled device) 
camera. According to particle tracking simulations, 2.2% of the 60- 
MeV beam is transmitted through the vacuum channel of the 400-nm 
gap structure (see Methods). A segment of the spectrometer screen 
focusing on this transmitted distribution is shown in Fig. 2a. The 
horizontal axis represents beam energy, and the entire image spans 
240 keV. The central pixel location of the 60-MeV beam is taken as the 
reference point, corresponding to zero energy deviation (AE). 

The spectrometer image in Fig. 2a is a median filtered average of a 
dozen shots. The least-squares fit to the distribution of electrons scat- 
tered by the fused silica substrate and the grating teeth has been removed 
from this image to emphasize the transmitted distribution (see Methods). 
A similarly averaged set of laser-on spectrometer images within 0.5 ps 
of the optimal timing overlap for laser pulses with energy 93 J per 
pulse is shown in Fig. 2b. The white contour in both Fig. 2a and Fig. 2b 
denotes the location where the spectral charge density is 4.5% of the 
maximum density (at the peak of the scattered distribution). In the 
presence of a laser field, there exists a higher charge density on either 
side of the original peak at AE = 0. The white contour shows a sizable 
fraction of electrons with maximum energy that is ~60 keV higher 
than in the laser-off case. 

The laser-induced energy modulation is readily apparent in the energy 
spectra (Fig. 2c). Using the fits to these spectra, a maximum energy 
shift of 53.1 keV is calculated from the abscissa of the half-width at half- 
maximum (HWHM) point in the high-energy tail. We use an analy- 
tical interaction model (see Methods and Extended Data Fig. 3) to 
calculate an accelerating gradient from this measurement. Figure 2c 
shows the input electron beam distribution used in the model (blue 
curve), which is a fit to the measured spectrum in the absence of a laser 
field (light blue crosses). The calculated energy modulation (red curve) 
agrees with our measurement (pink crosses), and gives a correspond- 
ing accelerating gradient of 151.2 MeV m ' for this example. Particle 
tracking simulations (black dots; see Methods) at this gradient level give 
an independent confirmation of the observed modulated spectrum. 

To determine the maximum gradient at a given laser power level, we 
measure the energy modulation as the laser pulse is temporally scanned 
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field polarization direction and the effective periodic phase reset, depicted as 
alternating red (acceleration) and black (deceleration) arrows. A snapshot of 
the simulated fields in the structure shows the corresponding spatial 
modulation in the vacuum channel. See text for details. 


across the electron beam, forming a cross-correlation signal. A sample 
measurement at a laser pulse energy of 91.8 + 1.3 J over a laser delay 
of 6 ps is shown Fig. 3a. The orange circles (laser-off data) show no 
variation correlated with laser delay, as expected, and have an r.m.s. 
deviation of 4.5 keV, which is taken as the noise floor level of the mea- 
surement. The blue circles (laser-on data) show the expected sech’ dis- 
tribution with a full-width at half-maximum (FWHM) of 1.89 + 0.09 ps, 
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Figure 2 | Demonstration of energy modulation. a, Image of the transmitted 
electron beam on the spectrometer screen, with the laser off. b, As a but when 
the laser field is present. c, Energy spectra from a and b showing energy 
modulation. A fit (blue curve) to the measured laser-off spectrum (light blue 
crosses) is used as input for the simulations. The calculated energy modulation 
(red curve) and particle tracking simulations (black dots) agree with our 
measured spectrum (pink crosses). Images of the entire spectrometer screen are 
shown in Extended Data Fig. 2. 
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Figure 3 | Measurement of acceleration gradient in the 400-nm gap 
structure. a, Maximum energy shift as a function of laser delay gives a cross- 
correlation measurement of the laser-electron interaction. The orange filled 
circles (laser-off data) show no variation with laser delay and have an r.m.s 
deviation of 4.5 keV which is taken as the noise level (black dashed line). The 
blue filled circles (laser-on data) show the expected sech” distribution (blue 
curve). b, Calculated acceleration gradient (blue filled circles), G, as a function 


which is in agreement with the predicted value of 2.00 + 0.13 ps (see 
Methods). For each data set, the magnitude of the electron-laser inter- 
action is determined by the amplitude of the sech’ fit, which represents 
the maximum energy gain at an optimal temporal overlap. We use this 
figure to calculate the accelerating gradient. For the data set shown in 
Fig. 3a, the observed maximum energy shift is 69.2 + 2.3 keV and the 
resulting gradient is 191.1+5.7MeVm_', which falls within the 
expected range (see Supplementary Information). 

The cross-correlation measurement was repeated at multiple pulse 
energies up to 319 WJ, the maximum pulse energy available. Figure 3b 
shows the acceleration gradient G as a function of peak incident field 
Ep; along with analytical gradient calculations (see Methods) for the 
best case (green line) and worst case (red line) structure fabrication. 
We find that the measured gradient varies linearly with incident electric 
field as expected, and the measured values fall within the anticipated 
range. The highest gradient observed here is 309.8 + 20.7 MeV m '; 
this is twice the highest recorded values in radio-frequency (RF)-based 
demonstration cavities, and 6-10 times higher than typical operating 
gradients in RF structures. The deviation of the high-energy data 
points in Fig. 3b below the predicted linear trend is probably due to 
structural damage. Further optimization of the fabrication and materials 
can allow for operation at higher gradients below the laser damage 
fluence level. Additionally, despite week-long exposure of the struc- 
ture, no electron-beam-induced charging or radiation damage was 
observed in the post-examination, demonstrating the robustness of 
dielectric structures as accelerators'’. 

To further verify the linear dependence of the laser-electron inter- 
action on the electric field, the laser polarization and incidence angles 
were varied at a fixed laser power level. As shown in Extended Data 
Fig. 5, the measured gradients show good agreement with the expected 
dependencies and confirm that an intensity-dependent ponderomo- 
tive force is not responsible for the observed laser-electron interaction. 
Additionally, the fit to the data reveals that the laser—electron inter- 
action occurs over N = 563 + 104 periods of the structure. 

We have produced the first demonstration of scalable laser-driven 
acceleration with gradients exceeding 250 MeV m_' in an ultra-compact 
and inexpensive DLA. The results are in agreement with both the 
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of peak incident longitudinal electric field, Eo, showing expected linear 
dependence (dashed blue line). Gradients in excess of 250 MeV m ‘are 
observed within the simulated best-case (dashed green line) and worst-case 
(dashed red line) structure fabrication tolerance limits. Measurements on the 
800-nm gap structure are shown in Extended Data Fig. 4. Error bars in a and 
b, 68% confidence interval. 


analytical treatment and particle tracking simulations. The pulse dura- 
tion of the electron beam used in this experiment is longer than the 
optical cycle of the laser, which results in a modulation of the electron 
beam in energy. An ideal DLA-matched electron source should not 
only be sub-femtosecond in duration, but also have a nanometre-scale 
spot size, to efficiently utilize the maximum acceleration gradients in 
an optimized sub-wavelength aperture DLA accelerator. To create 
such a source, a method of bunching the electron beam at optical 
frequencies has been previously demonstrated"* and needle cathodes 
are being developed to create high-brightness low-emittance laser- 
triggered electron beam sources’. By bunching the beam, the fraction 
of captured electrons can jump from a few tens of per cent in a 5% 
energy window to >80% with an energy spread of <1% (ref. 20). 

Integrating these novel electron sources with DLA structures would 
allow phase-stable net acceleration of synchronous attosecond micro- 
bunches over many accelerating stages. Additionally, the relatively small 
operating beam charge in these microstructures can be compensated 
by the MHz repetition rate of current laser systems to achieve the beam 
luminosity required for high-energy physics research”’. Power efficiency 
estimates for such DLA devices are comparable with conventional RF 
technology, assuming that similar power efficiency (near 100%) for 
guided wave systems can be achieved, 40% laser electrical-to-optical 
conversion efficiencies (feasible with solid state thulium fibre laser 
systems)” and 40% laser to electron-beam coupling”. Continuously 
tunable radiation is a natural by-product of accelerator technology, so 
this demonstration supports the viability of DLA for the development 
of compact sources of coherent, attosecond-scale X-ray pulses” with 
applications in medical therapy, biological and materials research, and 
industrial processing. 


METHODS SUMMARY 


The DLA structure was fabricated by bonding two fused-silica wafers, which were 
processed via standard optical lithography and reactive ion etching techniques’” 
(Extended Data Fig. 1). We used a conventional linear accelerator to generate a 
relativistic electron beam, with a 10 im spot size. Owing to the dimensional mis- 
match between the electron beam and the sub-micrometre DLA aperture, many of 
the electrons experienced collisional straggling energy loss” as they traversed the 
fused-silica substrate (rather than the vacuum aperture). This process generated 
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observable optical transition radiation (OTR) as a by-product, which was subsequently 
used as a diagnostic to align the laser pulse to the electron beam, both spatially and 
temporally. The alignment of the electron beam to the aperture of the DLA struc- 
ture was optimized by maximizing the population of transmitted electrons, as 
indicated by the electron energy spectrum (Extended Data Fig. 2). Our under- 
standing of the observed spectrum was verified with G4Beamline”® simulations of 
the beam scattering and radiative losses. To quantify the maximum energy gain in 
our structure, we developed an analytical model (Extended Data Fig. 3) and per- 
formed particle tracking simulations, both of which agreed with our measure- 
ments. Damage threshold measurements” of identical DLA structures yielded a 
laser damage threshold of 0.85 + 0.14J cm”. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Grating structure fabrication. Fabrication of the dual layer grating structures was 
carried out at the Stanford Nanofabrication Facility (SNF). The gratings are pat- 
terned on fused silica wafers via optical lithography using an ASML PAS 5500 
i-line stepper and magnetically enhanced reactive ion etching on an Applied 
Materials P5000 etcher. This is shown schematically in steps 1-3 of Extended 
Data Fig. la. 

Four grating structures are included in every sample to facilitate a switch during 

the experiment, two with a 400-nm gap, and two with an 800-nm gap. Once 
patterned and etched, alignment channels of decreasing sizes (250 jum X 250 um 
— 80um X 80pm — 80m X 20m) are cut into the wafers to facilitate the 
alignment of the electron beam with the structure. Two wafers are then bonded 
directly to each other via fusion bonding and diced into individual samples (steps 
4-5 of Extended Data Fig. 1a). The alignment tolerance of the bonding process is 
~3 um so the grating-to-grating longitudinal alignment is not ensured. However, 
a perfectly aligned structure is not required to generate substantial acceleration 
gradients’*. As a final step, a section of the sample is metal-coated to serve as a 
reflector for the NIR laser, and a wedge is cut at 45° to be able to image the infrared 
laser beam and optical transition radiation (OTR) simultaneously. These two 
features, along with the alignment channels and the four grating structures, are 
shown in the schematic in Extended Data Fig. Ic. 
Electron beam. Testing of the dielectric accelerator structures is conducted using 
pre-accelerated electron bunches at the Next Linear Collider Test Accelerator 
(NLCTA) facility at SLAC National Accelerator Laboratory. The electron bunches 
are produced by an RF photoinjector and accelerated up to 60 MeV in a travelling 
wave RF cavity at a repetition rate of 10 Hz. Energy collimation is used to create a 
stable monoenergetic ‘slice’ of the electron bunch with a FWHM energy spread of 
24 keV or 0.04%, anr.m.s. length of 129 + 9 um (0.43 + 0.03 ps), and emittances of 
16mm mrad X 7 mm mrad in X and Y, respectively. Of the initial 5 pC of charge, 
10% is transmitted through the energy collimators, giving an estimated charge of 
0.5 pC incident on the grating structure. Based on particle scattering simulations, 
2.2% of these particles are transmitted through the 400-nm gap structure’s vacuum 
channel. Consequently, the population of electrons that pass through the accel- 
erating channel of the 400-nm gap device contains an estimated 11 fC of charge per 
electron bunch. A sample spectrometer image for the case of the electron beam 
travelling entirely through vacuum (that is, in the absence of the structure) is shown 
in Extended Data Fig. 2a. The horizontal axis represents beam energy and the 
resolution limit of the spectrometer is 1.2 keV per image pixel so the entire image 
spans 800 keV. The central pixel location of the narrow energy spread 60-MeV 
beam is taken as the reference point, corresponding to zero energy deviation (AE). 
Achieving electron transmission. Transmission of electrons through the sub- 
micrometre grating aperture is achieved with the help of alignment apertures of 
decreasing size that run parallel to the grating vacuum channels (see Extended 
Data Fig. 1c). Once transmission on the large channel is observed, the structure’s 
optimal pitch and yaw are determined by maximizing the transmitted distribution. 
This is repeated for the smaller apertures until we see transmission through the 
grating aperture. Once this is achieved, the laser beam incidence angle is adjusted 
to ensure perpendicularity. 

Because the NLCTA beam is significantly larger than the structure aperture, 
only a small fraction of electrons travels through the vacuum channel when the 
DLA structure is in place. Extended Data Fig. 2b shows the corresponding spectro- 
meter image for this case, containing two distinct distributions. The prominent 
broad distribution, centred at AE = —340 keV, represents the (majority) electron 
population that travels primarily through the fused-silica substrate. The energy of 
these electrons is reduced due to collisional straggling loss in the material”. The 
smaller, more compact distribution at AE = 0 keV corresponds to the population 
of electrons that traverse the grating structure primarily through the vacuum 
channel. Because the channel has a 1,280:1 aspect ratio (430 nm tall, and 550 pm 
long), these electrons are collimated with an acceptance angle of 0.04°. This dis- 
tribution therefore has a reduced vertical spread but retains its original energy 
spread. Henceforth, we refer to the higher-energy peak as the transmitted popu- 
lation and the lower-energy peak as the scattered population. 

The formation of these two distributions is confirmed by simulations of the 
scattering and radiative losses of the electron beam as it traverses the grating 
structure. Extended Data Fig. 2c shows a projection of the spectrometer image 
of Extended Data Fig. 2b onto the energy axis which gives the resulting electron 
energy spectrum. Also shown is the calculated spectrum from particle scattering 
simulations, which has peak locations and relative amplitudes in agreement with 
the observed experimental data. Because the accelerating fields are strongest inside 
the vacuum channel, we focus on the transmitted distribution when studying the 
effects of the laser—electron interaction. In our analysis, the resulting spectra are 
fitted to a curve composed of two distributions (scattered and transmitted), using 
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the method of least-squares. A sample fit, shown (in orange) in Extended Data 
Fig. 2c, matches the measured spectra closely. 

Particle scattering simulations. We model the scattering and radiative losses of the 
electron beam as it traverses the grating structure using the Geant4 code G4Beamline” 
to calculate the expected electron energy spectrum. In this model, an electron beam 
with a transverse spot size of 24 um X 8 lum and a transverse vertical emittance of 
3 mm mrad is sent through an approximate model of the grating structure with a 
400-nm gap. The model structure is made up of five layers. A central, 400-nm tall, 
vacuum layer surrounded by two 660-nm-thick grating layers in the first 550 um of 
the longitudinal dimension. Two 1.1-mm-long, 500-m-thick layers of fused silica 
surround the grating layers. Extended Data Fig. 2c shows the results of this simu- 
lation in agreement with the experimental data, giving the correct spectrum peak 
separation of 340 keV and relative peak amplitude. 

The simulations give a calculated percentage of 2.2% of the total number of 
electrons in the transmitted distribution. By comparison, simulations of the 800- 
nm gap structure yield 5.8% of the electrons in the transmitted distribution. 
Energy modulation simulations. The fields in the structure vacuum channel are 
calculated using the finite-element frequency-domain code HFSS and imported 
into a MATLAB-based code that computes the resulting relativistic Lorentz force 
on a test particle as it traverses the structure. In our simulations, we approximate 
the NLCTA electron beam using 2,500 test particles, whose position and momentum 
match those of the experiment. Owing to the long computation time required to 
perform such particle tracking simulations, we elect to model and scale a 100- 
period structure rather than model the entire 650-period structure. Based on 
convergence tests, we find that this approach produces accurate results, which is 
expected from the linearity of the acceleration effect, at a fraction of the computa- 
tion cost. The final energy distribution of the 2,500-particle ensemble is then 
calculated from the results of each individual particle trace. 

Analytical treatment of energy modulation. The interaction between a NIR laser 
pulse and a picosecond-scale electron bunch in a grating accelerator structure can 
be described analytically by partitioning the long electron bunch into a series of 
short finite slices of length t, as shown in Extended Data Fig. 3a, where each slice 
satisfies the relation t < A/c. In the presence of the sinusoidally varying electric 
fields of the structure, each slice of the electron bunch will experience a net energy 
shift with a negligible effect on the energy profile. For example, a slice of an electron 
bunch with an asymmetric energy profile typical of the NLCTA will experience a 
net energy gain of 50 keV if injected at the optimal phase in a millimetre-long 
structure capable of 50 MeV m_' gradients (see green slice in Extended Data Fig. 3b). 
The net effect of the structure is calculated by superimposing the effect for each 
individual slice. For a long pulse, we find that the predicted effect on the electron 
energy spectrum is a transition from a single peaked distribution to a double 
peaked profile of lower amplitude, as shown in Extended Data Fig. 3c. 

Energy spectrum fits. The projected energy spectra are fitted to curves containing 
the two distinct, scattered and transmitted, electron distributions. Several different 
profiles including asymmetric Gaussian, Lorenztian, sech? and Landau distribu- 
tions, along with combinations of these were initially employed. These were com- 
pared against the measured single-peak data (that is, electron beam incident on the 
fused-silica sample but away from the grating vacuum channel) to determine a 
profile that minimized the 7’ value of the fits. The scattered distribution was 
ultimately fitted to a half Lorenztian on its low-energy side, and a half sech” on 
its high-energy side. The transmitted distribution was fitted to a double-peak 
version of the scattered distribution, with the separation of the two peaks as a free 
parameter and mirrored about its centre. The sum of the two distributions fit the 
observed spectra very well, as indicated in Extended Data Fig. 2c. 

On the basis of comparisons with the particle scattering simulations and the 
analytical treatment, we find that a partially-straggled population exists, composed 
of electrons which are scattered (partially) by the grating teeth. This is accurately 
accounted for by the low-energy side Lorentzian function used to fit the transmit- 
ted distribution. To isolate the fully transmitted electrons (as shown in Fig. 2c), we 
subtract both the straggled distribution and the partially straggled distribution. 
The difference between the abscissa of the HWHM point in the high-energy tail for 
a laser-on spectrum and a laser-off spectrum gives a measurement of the laser- 
induced maximum energy shift. 

Spatial overlap. We use a miniature aluminium pellicle mirror and along working 
distance Cassegrain telescope to image the back of the grating sample with a 
resolution limit of 8 um per image pixel. To perform the spatial alignment, we 
simultaneously determine the location of the OTR origin on the sample and the 
laser beam scattering off the fabricated 45° wedge shown in Extended Data Fig. Ic. 
The laser beam is then steered until the observed scattering lines up with the OTR 
location to within ~10 um. 

Temporal overlap. A fast photodiode (25-GHz bandwidth) is used to detect the 
OTR signal the electron beam generates at an aluminium pellicle located down- 
stream of the sample. The transmitted NIR pulse is also measured simultaneously, 
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on the same detector and oscilloscope. We observe the measured arrival time of the 
two signals and adjust a four-pass optical delay line to achieve sub-nanosecond 
overlap. A motorized delay line provides gross overlap to within the 50-ps reso- 
lution limit of the oscilloscope, and a voice coil actuated retroreflector is scanned 
during the experiment to achieve picosecond-scale timing. 

Cross-correlation measurement. A typical cross-correlation signal in our experi- 
ment consist of 300 independent electron-laser interaction events measured at 150 
unique temporal overlap positions. The resulting cross-correlation signal follows a 
hyperbolic secant squared (sech”) pattern as expected based on the laser pulse 
profile indicated by autocorrelation measurements. 

Typical cross-correlation measurements co-propagate two signals through an 
interacting medium, while varying the temporal overlap. In this case, the cross- 
correlation signal is solely dependent on the temporal dimension. However, in our 
cross-correlation measurement, we use two signals incident at perpendicular angles, 
and as a result, the cross-correlation signal is a function of both the temporal and 
spatial dimensions of our electron bunch and our laser beam. The expected out- 
come of this interaction can be calculated by simulating the perpendicular crossing 
of an electron bunch with a laser beam, with the temporal and spatial dimensions 
set to match the experimental values. From this calculation, we predict a FWHM of 
2.00 + 0.13 ps for the sech? fit to the cross-correlation signal. This is in strong 
agreement with our experimental findings, where a cross-correlation FWHM 
equal to 1.89 + 0.09 ps was observed. 

Damage threshold measurement. Damage threshold measurements of DLA 
structures from the same wafer as the one used in the acceleration experiment 
were performed using an experimental set-up similar to the one described in ref. 27. 


An array of accelerator structures was placed in a vacuum chamber at a pressure of 
10° torr. The NIR pulses from our Ti:sapphire laser system was then focused to a 
r.m.s. spot size of 27 jum X 53 tm and incident on the structure, in the same 
manner as in the acceleration experiment. The laser fluence was then gradually 
increased from an initial value of 2.00 J in increments of 1.85 uJ, while the illu- 
minated site was monitored with a camera. At each energy level, approximately 
5,000 laser pulses were applied to the structure. Damage was determined to occur 
when a visible change in the strength and consistency of the scatted infrared light at 
the illumination site was observed. A total of 24 independent damage threshold 
measurements were taken, and each damage site was afterwards inspected under a 
high powered optical microscope to verify the occurrence of laser damage. 

The results of these measurements indicate a laser damage threshold of 
0.85 + 0.14J cm * for our grating accelerator structures. From post-examination 
of the structure it is clear that this damage threshold value was exceeded during the 
course of the experiment. However, we continued to see acceleration up to the 
maximum available pulse energy. On the basis of post-analysis of a laser damage 
site, we deduced that our laser r.m.s. spot size during the experiment was actually 
76 jum X 294 um, yielding a maximum fluence of 0.85J cm * on the gratings. 
Error analysis. The corresponding error bar for each data point represents a 68% 
confidence interval for the plotted fit parameter, as calculated by the Jacobian of 
the weighted nonlinear least-squares regression. The noise floor in the plots is 
defined by the laser-off data for each set of measurements. Specifically, we define 
the gradient of the laser-off data as zero gradient, and used the positive tail of the 
68% confidence interval as the noise floor. 


©2013 Macmillan Publishers Limited. All rights reserved 


a Side view Front view 


1. Pattern Cr alignment marks on fused-silica substrate. 


2. Etch trenches to a depth of .5g to define structure half-gap. 
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Extended Data Figure 1 | DLA sample preparation. a, Diagram of the 
structure fabrication process. In side view, the electron beam traverses the 
structure from left to right. In front view, the beam goes into the page. Laser is 
incident from above. See Methods for details. b, Picture of a completed wafer 
with hundreds of DLA structures. c, Diagram of a finished sample ready for 
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beam tests including four DLA structures, alignment channels, a wedge for 
spatial alignment of the laser to the electron beam, and a metal coating for 
perpendicular alignment of the laser. d, Picture of a single DLA structure on a 
fingertip. 
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Extended Data Figure 2 | Reference spectrometer screen images and 
spectrum. a, Spectrometer screen image showing the 60-MeV beam with no 
DLA structure in place. b, Spectrometer screen image of the beam after 
traversing the grating structure. c, Projection of b onto the energy axis yields the 
energy spectrum (black crosses) in agreement with particle scattering 
simulations (blue dots). The corresponding least squares spectrum fit (orange 
curve) is also shown. 
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Extended Data Figure 3 | Analytical treatment of laser-driven electron 
energy modulation. a, Picosecond-scale electron beam partitioned into a 
series of finite slices over one optical cycle of the laser (for example, the green 
slice is at optimal phase for acceleration). b, Each slice samples a different phase 
of the laser pulse and therefore experiences a corresponding net energy shift 
with a negligible effect on the energy profile (for example, the green slice 
experiences net energy gain). c, When all contributions are superimposed, the 
initial single distribution (dashed blue line) becomes a double-peaked profile 
(red line), in agreement with particle tracking simulations (black crosses). 
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Extended Data Figure 4 | Gradient measurement on 800-nm gap 
structure. Calculated gradient (blue filled circles) G as a function of 
longitudinal electric field Ey showing expected linear dependence (dashed blue 
line) and reduced strength when compared to the 400-nm gap structure, as 
expected for a larger gap. The dashed black line is the measurement noise level. 
Error bars, 68% confidence interval. 
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Extended Data Figure 5 | Experimental verification of direct laser 
acceleration. a, Gradient in the 800-nm gap structure for an input pulse energy 
of 105.2 + 3.6 J. As the laser polarization is rotated away from the direction of 
electron propagation by an angle ¢, the acceleration gradient varies as 

G x cos. Data for ¢ <0 were taken last, and beam quality had degraded. 

b, Gradient in the 400-nm gap structure at an input pulse energy of 

29.3 + 0.4 J, averaging the observed modulation over many shots taken at an 
optimal timing overlap. As the laser incidence angle deviates from 
perpendicular by an angle 0, the observed gradient decreases according to the 
expected relationship, equation (2) in Supplementary Information. Data are 
shown as blue filled circles with the corresponding least squares fit shown as 
green lines. The dashed black line is the measurement noise level. Error bars, 
68% confidence interval. 
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Emergence of macroscopic directed motion in 
populations of motile colloids 


Antoine Bricard!*, Jean-Baptiste Caussin'**, Nicolas Desreumaux', Olivier Dauchot® & Denis Bartolo’? 


From the formation of animal flocks to the emergence of coordi- 
nated motion in bacterial swarms, populations of motile organ- 
isms at all scales display coherent collective motion. This consistent 
behaviour strongly contrasts with the difference in communication 
abilities between the individuals. On the basis of this universal 
feature, it has been proposed that alignment rules at the individual 
level could solely account for the emergence of unidirectional 
motion at the group level’*. This hypothesis has been supported 
by agent-based simulations’*°. However, more complex collective 
behaviours have been systematically found in experiments, includ- 
ing the formation of vortices’’, fluctuating swarms”"°, cluster- 
ing’’” and swirling’***. All these (living and man-made) model 
systems (bacteria®’®'*, biofilaments and molecular motors”*"’, sha- 
ken grains‘*”’ and reactive colloids'*’*) predominantly rely on actual 
collisions to generate collective motion. As a result, the potential 
local alignment rules are entangled with more complex, and often 
unknown, interactions. The large-scale behaviour of the populations 
therefore strongly depends on these uncontrolled microscopic cou- 
plings, which are extremely challenging to measure and describe 
theoretically. Here we report that dilute populations of millions of 
colloidal rolling particles self-organize to achieve coherent motion in 
a unique direction, with very few density and velocity fluctuations. 
Quantitatively identifying the microscopic interactions between the 
rollers allows a theoretical description of this polar-liquid state. 
Comparison of the theory with experiment suggests that hydrodyn- 
amic interactions promote the emergence of collective motion either 
in the form of a single macroscopic ‘flock’, at low densities, or in that 
of a homogenous polar phase, at higher densities. Furthermore, 
hydrodynamics protects the polar-liquid state from the giant density 
fluctuations that were hitherto considered the hallmark of popula- 
tions of self-propelled particles**’’. Our experiments demonstrate 
that genuine physical interactions at the individual level are suf- 
ficient to set homogeneous active populations into stable directed 
motion. 

Our system consists of large populations of colloids capable of self- 
propulsion and of sensing the orientation of their neighbours solely 
by means of hydrodynamic and electrostatic mechanisms. We take 
advantage of an overlooked electrohydrodynamic phenomenon known 
as Quincke rotation’*”” (Fig. 1a). When an electric field, Eo, is applied to 
an insulating sphere immersed in a conducting fluid, above a critical 
field amplitude, Eg, the charge distribution at the sphere’s surface is 
unstable to infinitesimal fluctuations. This spontaneous symmetry 
breaking results in a net electrostatic torque, which causes the sphere 
to rotate at a constant speed around a random direction transverse to 
Ep (ref. 18). We exploit this instability to engineer self-propelled col- 
loidal rollers. We use poly(methyl methacrylate) beads of radius 
a= 2A um diluted in an hexadecane solution filling the gap between 
two conducting glass slides. Once the particles have sedimented on the 
bottom electrode, we apply a homogeneous electric field, and indeed 
observe them to start rolling at high speed (Fig. 1a). Isolated rollers, as 


we refer to these particles, travel in random directions (Fig. 1b). Their 
velocity, vo, is set by Ey and scales as [(Ep/ Eo) — 1] his (Fig. 1c and 
Supplementary Methods). 

To study the emergence of collective unidirectional motion, we 
electrically confine the roller populations in racetracks periodic in 
the curvilinear coordinate, s, that measures position along the track. 
Their width is 500 pm < W <5 mm (Fig. 2a and Methods). During a 
typical 10-min-long experiment, millions of rollers travel over dis- 
tances as large as 10°-10° particle radii, which makes it possible to 
investigate exceptionally large-scale dynamics. At low area fraction, $0, 
the rollers form an isotropic gaseous phase. They all move at the same 
velocity in random directions, as would an isolated particle (Fig. 2b and 
Supplementary Video 1). On increasing #9 above a critical value, p., we 
observe a clear transition to collective motion. A macroscopic fraction 
of the rollers self-organizes and its constituents cruise coherently in the 
same direction (Fig. 2c, d and Supplementary Videos 2-4). More 
quantitatively, we define a polarization order parameter, Ip, as the 
modulus of the time and ensemble average of the particle-velocity 
orientation. This parameter increases sharply with 9 and has a slope 
discontinuity at 6. = 3 X 10°, revealing the strongly collective nature 
of the transition (Fig. 2e). Remarkably, . is a material constant: it is 
independent of the electric field amplitude. 

For area fractions higher than but close to ¢., small density excita- 
tions nucleate from an unstable isotropic state and propagate in ran- 
dom directions. After complex collisions and coalescence events, the 
system phase separates to form a single macroscopic band that pro- 
pagates at a constant velocity, chang through an isotropic gaseous phase 
(Figs 2a, c and 3a and Supplementary Videos 2 and 3). No stationary 
state involving more than a single band was observed even in the 
largest systems (10-cm long). The velocity chang is found to be very 
close to the single-particle velocity, vo, at the front of the band. The 
bands are coupled to a net particle flux: they are colloidal flocks trav- 
elling through an isotropic phase. Their density profile is strongly 
asymmetric, unlike the slender bands observed in dense motility assays’. 
This marked asymmetry is akin to that found in one-dimensional, 
agent-based models”. It might be promoted by the high aspect ratio 
of the confinement. The local area fraction at curvilinear coordinate s 
and time t, #(s, f), increases sharply with s and then decays exponentially 
to aconstant value, #.., which is very close to the critical volume fraction, 
, (Fig. 3b). This behaviour is similar to that found in numerical simula- 
tions of the celebrated two-dimensional Vicsek model***. Remarkably, 
the bands have no intrinsic scale and their length, Li,,,a is set by particle- 
number conservation only. This result is readily inferred from Fig. 3c, 
which shows that the bands span a fraction of the racetrack that merely 
increases with #9 regardless of the overall curvilinear length, L. 

Looking now at the local polarization, we observe that the colloidal 
flock loses its internal coherence away from the band front as [/(s, t) 
decays continuously to zero along the band. Quantitatively, (s, f) and 
II(s, t) are related in a universal manner irrespective of both the particle 
velocity and the mean volume fraction (Fig. 3d). All our data collapse 
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Figure 1 | Single-roller dynamics. a, Sketch of the Quincke rotation and of the 
self-propulsion mechanisms of a colloidal roller characterized by its electric 
polarization, P, and superposition of ten successive snapshots of colloidal 
rollers. Time interval, 5.6 ms; scale bar, 50 1m. b, Probability distribution of the 
velocity vector (v||, v, ) for isolated rollers: v|| corresponds to the projection of 
the velocity on the direction tangent to the racetrack (Fig. 2); v, is normal to 
v| |. The probability distribution involves more than 1.4 x 10° measurements of 
instantaneous speed. c, Roller velocity, vo, plotted versus the field amplitude, Eo. 
Inset, vj versus Ej. The black dots represent the maximum of the probability 
distribution. Error bars, 1 s.d. 


on a single master curve solely parameterized by the particle fraction, 
bec, away from the band: /7(s,t)=1—¢,, /$(s,t). As it turns out, this 
relation corresponds to particle-number conservation in a system 
where density and polarization waves propagate steadily at a velocity 
Vo (ref. 22 and Supplementary Methods). This observation unambigu- 
ously demonstrates that the band state corresponds to a genuine sta- 
tionary flocking phase of colloidal active matter. 

On further increasing the area fraction to more than do = 2X 10 2, 
transient bands eventually catch up with themselves along the periodic 
direction and form a homogeneous polar phase (Fig. 2d and Supp- 
lementary Video 4) in which the velocity distribution condenses on a 
single orientation of motion (Fig. 4a, to be contrasted with the perfectly 


Figure 2 | Transition to directed collective motion. a, Dark-field pictures ofa 
roller population that spontaneously forms a macroscopic band propagating 
along the racetrack. Ep/Eg = 1.39, do = 10°. Scale bar, 5mm. b-d Close-up 
views. The arrows correspond to the roller displacement between two 
subsequent video frames (180 frames s_'). b, Isotropic gas. ¢) = 6 X 10-*. 
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isotropic distribution for fractions less than ¢. in Fig. 1b). Conversely, 
the roller positions are weakly correlated, as evidenced by the shape of 
the pair-distribution function, which is similar to that found in low- 
density molecular liquids (Fig. 4b). We also emphasize that the density 
fluctuations are normal at all scales (Fig. 4c). This is experimental 
observation of a polar-liquid phase of active matter. The existence of 
a polar-liquid phase was theoretically established yet had not been 
observed in any prior experiment involving active materials. Until 
now, collective motion has been found to occur in the form of patterns 
with marked density, orientational heterogeneities or both”!°'*"*"°. 
Furthermore, in contrast with the present observations, giant density 
fluctuations are considered to be a generic feature of the uniaxially 
ordered states of liquids comprising self-propelled particles**'”. We 
resolve this apparent contradiction below and quantitatively explain 
our experimental observations. 

From a theoretical perspective, the main advantage offered by the 
rollers is that their interactions are clearly identified. We show in 
Supplementary Methods how to establish the equations of motion of 
Quincke rollers interacting through electrostatic and far-field hydro- 
dynamic interactions. They take a compact form both for the position 
r; and the orientation p; of the ith particle: 


1; = Vopi 


~ 0 ak 

0; = a » 00; Fete (1; Piri) 
Here p; makes an angle 0; with the x axis, and a dot denotes a time 
derivative. In dilute systems, the particle interactions do not affect their 
propulsion speed, yet the electric field and flow field compete to align 
the p; with them. This competition results in an effective potential, Hers 
for the p;. At leading order in a/r 


Hest (1,pi.Pj) =A(1) prj + Br) pr? 
+ C(r) pie( 277 —I)-p; 
where A(r) is a positive function and thus promotes the alignment of 
the neighbouring rollers, J is the identity matrix, 77 is the outer product 
of r with itself, and a dot denotes tensor contraction. Importantly, A is 
dominated by a hydrodynamic interaction, which arises from a hydro- 


dynamic-rotlet singularity screened over distances of the order of the 
chamber height’’. The function B(r) is also short ranged and accounts 


10-1 % 


%o 102 


c, Propagating band. #) = 10 7. d, Homogeneous polar liquid. 

io = 1.8 X 10— ' Scale bar, 500 jum. e, Modulus of the average polarization, [7o, 
plotted versus the area fraction, #. Collective motion occurs as #y exceeds 
¢.=3 X10 *. 6, is independent of Eo. Error bars, 1s.d. e|| (or e, ) is the unit 
vector oriented along the tangent (or the normal) of the racetrack confinement. 
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Figure 3 | Propagating-band state. a, Spatiotemporal variations of the area 
fraction measured along the curvilinear coordinate, s, and temporal variations 
of the area fraction at s = 0.8L (white dashed line), where L is the overall length 
of the racetrack. b, Band shape plotted versus the rescaled curvilinear 
coordinate, s/L, for ¢y) = 5.3 X 10° (dark blue), 7.8 X 10° (blue), 1.0 X 10°? 
(cyan) and 1.5 X 10° (orange). c, The rescaled band length, Lyang/L, increases 
with #9 and is independent of L (white dots, L = 28 mm; grey dots, L = 50 mm; 
black dots, L = 73 mm). Error bars show the estimated error associated with the 
measurement of Lpana- d, I7(s) plotted versus 1 — ¢../¢(s). The black dots 
correspond to averages over 5,000 local measurements (grey dots). The red 
curve is the theoretical prediction. Error bars, 1 s.d. 


for a dipolar repulsion. Conversely, C(r) is long ranged and decays 
algebraically as r 7 owing to another hydrodynamic singularity 
induced by the roller motion in confinement. This singularity is 
referred to as a source doublet”. Neither B nor C yields any net align- 
ment interaction. If these two terms were neglected, our model built 
from the actual microscopic interactions would amount to the ‘flying 
xy model’ introduced on phenomenological grounds in ref. 25. We 
emphasize that Her is independent of vp and Eo, and that it is not 
specific to the Quincke mechanism. Its form could have been deduced 
from generic arguments based on global rotational invariance. 

We then use a conventional Boltzmann-like kinetic-theory frame- 
work to derive the large-scale equation of motion for the density, and 
the polarization fields**”’. In the present case, this approximation was 
fully supported by the weak positional correlations in all the three 
phases, as exemplified in Fig. 4b. The resulting hydrodynamic equations 
are shown in Supplementary Methods. At the onset of collective motion, 
the magnitudes of the terms arising from the long-range hydrodynamic 
interactions are negligible. We are therefore left with equations for ¢ and 
IT akin to those in the model of refs 2, 3. However, we explicitly provide 
the functional form of the transport coefficients introduced on phe- 
nomenological grounds in ref. 2. Accordingly, we find that the competition 
between the polar ordering (induced by the short-range hydrodynamic 
interactions) and rotational diffusion yields a mean-field phase trans- 
ition between an isotropic state and a macroscopically ordered state 
(Supplementary Methods). The phase transition occurs above a critical 
fraction, #,, that does not depend on the particle velocity (that is, on 
Eo), in agreement with our experiments: collective motion chiefly 
stems from hydrodynamic interactions between the electrically pow- 
ered rollers. However, at the onset of collective motion (that is, for 
#9 > ¢.), the homogeneous polar state is linearly unstable to spatial 
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Figure 4 | Polar-liquid state. a, Probability distribution of the velocity vector 
(v||,v_) in the polar-liquid state. The probability distribution involves more 
than 3.2 X 10” measurements of instantaneous speed. b, Pair correlation 
function, g, of the particle position in the polar-liquid state. c, The variance of 
the number of colloids, AN’, scales linearly with the average number of colloids, 
N, counted inside boxes of increasing size. Ep/Eg = 1.39, io = 9.5 X 107. 


heterogeneities. Moreover, for do > #,, the compression modes are 
unstable eigenmodes of the isotropic state, in agreement with the 
emergence of density bands observed in the experiments, all starting 
from a homogeneous state and an isotropic velocity distribution. 

We also rigorously establish a kinetic theory for the strongly polar- 
ized state reached for ¢) >, (Supplementary Methods). In this 
regime, the short-range electrostatic repulsion matters, causing the 
density fluctuations to relax and stabilizing the polar-liquid state. In 
addition, the long-range hydrodynamic interactions further stabilize 
the system by damping the modes of /7 with non-zero divergence, that 
is, the splay modes that couple orientation disturbances to density 
fluctuations. As a result, the giant density fluctuations*® are sup- 
pressed, in agreement with our unanticipated experimental findings 
(Fig. 4c and Supplementary Methods). We stress here that these long- 
range hydrodynamic interactions do not depend at all on the propul- 
sion mechanism at the individual level. They solely arise from the 
confinement of the fluid in the z direction’. They are therefore not 
specific to the Quincke propulsion mechanism. 

The only way to destroy the robust polar-liquid phase is to prevent it 
geometrically by eliminating the angular periodicity of the confine- 
ment in the curvilinear coordinate. In rectangular geometries with 
large enough aspect ratios, we observe that the bands never relax but 
rather bounce endlessly against the confining box (Supplementary 
Video 5). In confinement with an aspect ratio of order one, the band 
state is replaced by a single macroscopic vortex (Supplementary Video 6). 

We have engineered large-scale populations of self-propelled part- 
icles from which collective macroscopic polar motion emerges from 
hydrodynamic interactions at exceptionally small densities. We believe 
that control over their interactions, and the ease with which they can be 
confined in custom geometries, will extend the present framework of 
active matter to include collective motion in more complex environ- 
ments relevant to biological, robotic and social systems. 


METHODS SUMMARY 


We use commercial poly(methyl methacrylate) colloids (Thermo Scientific G0500; 
2.4-um radius), dispersed in a 0.15 moll~! AOT/hexadecane solution. The sus- 
pension is injected into a wide microfluidic chamber made of double-sided Scotch 
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Tape. The tape is sandwiched between two ITO-coated glass slides (Solems, 
ITOSOL30; 80-nm thick). To achieve electric confinement, an additional layer 
of Scotch Tape including a hole with the desired geometry is added to the upper 
ITO-coated slide. The holes are made with a precision plotting cutter (Graphtec 
Robo CE 5000). The gap between the two ITO electrodes, H = 220 jm, is constant 
over the entire chamber. The electric field is applied by means ofa voltage amplifier 
(Trek 606E-6). The colloids are observed with a Nikon AMZ1500 stereomicro- 
scope (X1 magnification) equipped with a dark-field illuminator, and with a Zeiss 
Axiovert microscope (X10 objective) for local measurements. In both case, high- 
speed videos are taken with CMOS camera (Basler Ace) at frame rates between 70 
and 900 frames per second. The particles are detected to a precision of one pixel by 
locating the intensity maxima on the experimental pictures. The particle traject- 
ories are reconstructed using a conventional tracking code”’. 
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Colloidal assembly directed by virtual magnetic moulds 


Ahmet F. Demirérs', Pramod P. Pillai’, Bartlomiej Kowalczyk! & Bartosz A. Grzybowski' 


Interest in assemblies of colloidal particles'* has long been moti- 
vated by their applications in photonics”*, electronics”*, sensors* and 
microlenses’. Existing assembly schemes’”** can position colloids of 
one type relatively flexibly into a range of desired structures, but it 
remains challenging to produce multicomponent lattices, clusters 
with precisely controlled symmetries and three-dimensional assem- 
blies’®. A few schemes can efficiently produce complex colloidal struc- 
tures”’”"*, but they require system-specific procedures. Here we show 
that magnetic field microgradients established in a paramagnetic 
fluid’®”® can serve as ‘virtual moulds’ to act as templates for the 
assembly of large numbers (~10*) of both non-magnetic and mag- 
netic colloidal particles with micrometre precision and typical yields 
of 80 to 90 per cent. We illustrate the versatility of this approach by 
producing single-component and multicomponent colloidal arrays, 
complex three-dimensional structures and a variety of colloidal mole- 
cules from polymeric particles, silica particles and live bacteria and by 
showing that all of these structures can be made permanent. In addi- 
tion, although our magnetic moulds currently resemble optical traps 
in that they are limited to the manipulation of micrometre-sized 
objects, they are massively parallel and can manipulate non-magnetic 
and magnetic objects simultaneously in two and three dimensions. 

We use magnetic fields to manipulate either paramagnetic or diama- 
gnetic colloids simultaneously. The magnetostatic potential experienced 
by a particle of radius a in an applied field H at location ris given by 


Xpart — Xsol 
2na? Lo par SO. 


|H(r))” 
Xpart ar 2Xsol +3 


U(r) 


where [lp is the magnetic permeability of free space, and par and Zco1 
are the magnetic susceptibilities of the particle and of the dispersing 
medium or solution, respectively. Although magnetic effects on diama- 
gnetic particles are typically small, this limitation can be circumvented 
by adjusting the magnetic susceptibility contrast between the particles 
and the solution, Ypart — Yso1- In particular, the magnetic susceptibilities 
of solutions of paramagnetic salts (such as Ho(NO3)3) can be tuned”? 
be 
forces F,, = —VU,, attracting paramagnetic particles to, and expelling 
diamagnetic particles from, the regions of high field can be made to 
have comparable magnitudes. We expected that this effect, combined 
with the patterning of magnetic fields on length scales commensurate 
with particle sizes, could be used to manipulate or position both para- 
magnetic and diamagnetic particles. 

The magnetic field patterns are produced by using nickel grids of the 
desired geometry and typical periodicity L ~ 1-10 um. These grids were 
embedded in a layer of poly(dimethyl siloxane) (PDMS) about 300 nm 
thick and then placed on a permanent magnet of typical strength 0.442 T. 
(For fabrication details see Supplementary Fig. 1.) The nickel/PDMS 
composite film (Supplementary Fig. 2) concentrates and modulates the 
otherwise uniform field of the magnet, with the finite-element calcula- 
tions in Fig. 1a illustrating how this modulation translates into in-plane 
forces attracting paramagnetic particles onto the nickel regions and 
diamagnetic particles onto the voids of the grid. 

We first assemble two-dimensional arrays of fluorescently labelled 
colloidal particles by flowing colloidal suspensions in an index-matched 
solution (0.2-0.4M Ho(NO3;)3; in DMSO/H,O; see Supplementary 


so that 70", < 7X01 < Means In these circumstances, the magnetostatic 


Information for full specifications) over the grid at ~10° pls *cm *. 


As expected, magnetic particles localize onto the nickel islands (Fig. 1b) 
and non-magnetic particles localize to the grid voids (Fig. 1c and Sup- 
plementary Figs 5 and 6). The assemblies are highly regular over areas 
as large as 2cm X 2cm (see Supplementary Figs 5f and 7a), corres- 
ponding to O(10°) assembled particles. Magnetic and non-magnetic 
colloids can also be addressed simultaneously to create multicompo- 
nent arrays, such as the AB lattice of similarly sized magnetic and non- 
magnetic particles assembled on a square grid shown in Fig. 1d. When 
particle sizes differ substantially, steric effects can become important, 
as in the AB, structure in Fig. le: the smaller paramagnetic particles 
only fit onto the threefold junctions of the honeycomb grid, but not 
onto its edges between proximal larger particles. 

In the binary systems discussed so far, particle separation was based 
predominantly on the differences in their magnetic susceptibilities. The 
magnetophoretic forces, however, can also discriminate between parti- 
cles having the same Ypart but different sizes (because magnetostatic 
potential scales with particle volume, Uy oc ca This makes it possible 
to assemble structures comprising two types of diamagnetic colloid— 
when such particles assemble onto the voids of the nickel grid, it is most 
energetically favourable for the large particle to occupy the void’s centre, 
leaving only the peripheral region available for the smaller particles (see 
‘halo’ structures in the Supplementary Information and Supplementary 
Fig. 6c, d). This mode of assembly can be also combined with the posi- 
tioning of paramagnetic colloids to yield ternary structures as shown in 
Fig. 1f, where the paramagnetic particles (violet) localize onto nickel 
islands, the larger diamagnetic particles (green) centre in the spaces 
between four nearby islands, and the small diamagnetic colloids (orange) 
occupy the rest of the nickel-free surface. 

When the magnet is removed, the ‘virtual magnetic mould’ organi- 
zing the particles is lost and the assemblies fall apart. But the structures 
can be made permanent by fixing them to the substrate by means of 
carbamide bonds, so they can be transferred into polymers such as 
PDMS to give diffraction elements and photolithographic masks (see 
Supplementary Information and Supplementary Figs 7-9). 

By exploiting the fact that islands larger than the assembling particles 
will harbour several particles in a configuration that depends on the 
island’s shape and relative island-to-particle dimensions, we can also use 
our Virtual moulds to drive the assembly of colloidal clusters, also known as 
colloidal ‘molecules’. The large-area confocal image in Fig. 2a shows an 
example of an array in which each ‘void’ accumulates four diamagnetic 
particles. These tetramers A, are then attached to the substrate with car- 
bamide bonds, made permanent by silica deposition (see Supplementary 
Information) and liberated into solution on sonication (Fig. 2b). We have 
similarly assembled Ag hexamers in triangular voids (Fig. 2c). The AB 
dimer in Fig. 2d illustrates the assembly of multicomponent colloidal 
molecules, which in this example arises because the egg-shaped voids 
are too small to fit two large (1.2 um) particles but are able to accommodate 
one large and one smaller (800nm) diamagnetic particle. After silica 
deposition and sonication, these dimers can again be liberated into solu- 
tion. Voids of other shapes and sizes allow us to assemble particles into AB, 
(Fig. 2e), AB; (Fig. 2f) and even AB, (Fig. 2g) clusters. 

The magnetic fields of our virtual moulds extend into the bulk fluid; 
the plot of magnetic field strength over a nickel grid in Fig. 3a shows 
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Figure 1 | Micropatterned magnetophoretic forces direct the assembly of 
single-component and multicomponent colloidal arrays. a, Scheme of 
experimental arrangement and calculated force profiles (see Supplementary Figs 3 
and 4). Lateral magnetic forces (arrows labelled F,,) position magnetic particles 
(red) onto the nickel ‘islands’ of the grid and non-magnetic particles (blue) onto the 
‘voids’ of the grid. b, Magnetic particles 2.85 jum in size assembled on a square grid 
of nickel islands. Fidelity of the assembly, defined as the percentage of lattice sites 
without defects, was f = 97 + 2%. c, Diamagnetic particles 1.2 jum in size arranged 
in the voids of a hexagonal grid of nickel rings; f= 95 + 3%. d, Paramagnetic 
particles (red; 1 jum) and diamagnetic particles (green; 1.2 ,um) organize into an AB 
lattice on the underlying square grid; f= 93 + 6%. e, Paramagnetic particles 

(red; 1.5 jum) and non-magnetic particles (green; 2.5 jum) on a hexagonal grid form 
an AB, hexagonal lattice; f= 97 + 1%. f, Tertiary assembly of 2.65-,1m magnetic 
particles (purple) together with 3.2-j1m (green) and 800-nm (red) non-magnetic 
particles on a square nickel grid. The fidelity of the purple particles’ assembly was 
f= 92 + 5%; that of green particles was f= 90 + 6% (note that the arrangement of 
the 800-nm particles is not crystalline). All fvalues reported for this and subsequent 
crystalline or cluster assemblies are based on statistics for at least 500 particles 
and, in most cases, more than 1,000 particles, counted based on at least five 
(and up to ten) different images from at least three (up to seven) different 
samples. Scale bars, 10 jum. The insets illustrate the underlying nickel grids. 
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Figure 2 | Planar colloidal molecules. a, Large-area confocal image of a 
monolayer of 1.2-|1m diamagnetic particles assembled into tetramers over 
2.5-uum round voids ofa nickel grid; f= 84 + 5%. b, Left: the same array fixed to 
the PDMS substrate by means of isocyanate-amino chemistry and with 
particles bonded to one another by silica deposition. Middle: confocal image of 
the bonded particles liberated into solution by sonication. Right: scanning 
electron microscope (SEM) image of liberated tetramers after drying. 

c, Analogous images of hexamers of 1.2-1m particles assembled on 3-,1m 
triangular voids; f= 80 + 10%. d-g, Examples of colloidal molecules 
comprising two types of diamagnetic particle (1.2 1m (green) and 800 nm 
(red)). Fidelities fare 90 + 4% in d, 70 + 6% ine, 68 + 7% in f, and 57 = 6% in 
g. Dashed lines outline the contours of the grid voids. Scale bars, 2 um. See also 
Supplementary Movies 1-4. 


that the field is effectively modulated up to a distance above the mag- 
netic islands that is commensurate with their size. We should thus be 
able to organize colloidal particles into three-dimensional structures. 
Indeed, small diamagnetic particles introduced above a grid in large 
quantities pack over the nickel-free voids of the grid and form regular 
arrays of posts (Fig. 3c). By exploiting the size dependence of magnetic 
forces acting on non-magnetic particles (see above, Fig. 1fand Supplemen- 
tary Fig. 6c, d), we can create such posts with complex internal structures. 
In the example in Fig. 3c, d, the larger particles (green) are expelled 
from the high-field regions more strongly than the smaller particles; 
this effect results in pillars with core-and-shell cross-sections. Another 
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Figure 3 | Magnetic moulds directing the assembly of single-component 
and multicomponent three-dimensional colloidal structures. a, Profile of 
magnetic field strength above a nickel grid placed on a permanent magnet. 

b, Analogous profile of magnetic field strength but between two aligned nickel 
grids placed between two permanent magnets. c, Large-area SEM image of an 
array of posts formed by 2.5-lym and 800-nm diamagnetic particles. Inset: 
magnified top view. d, Confocal image of the posts shows that larger particles 
(green) localize mostly to the core of each post, whereas smaller particles (red) 
form an outer layer. This core-and-shell arrangement is confirmed by the inset 
SEM image of a post that was cut with a razor blade at approximately 
half-height. e, Diamagnetic particles 2.5 um in size assemble into 
three-dimensional bridges. Inset: top view of one of the pillars. f, The same 
particles assembled into structures inspired by the architecture of the Blue 
Mosque (lower inset). For the architecture of the underlying nickel grids, see 
Supplementary Information and Supplementary Fig. 10. g, h, ‘Fabergé eggs’ 
formed between two nickel grids from 1.2-j4m diamagnetic particles. The two 
structures differ in the amount of particles assembled. All particles used here 
were -NH, functionalized, and before imaging the assembled structures were 
fixed by the addition of glutaraldehyde (50 ul of 25% solution per 1 ml of 
Ho(NO3)3 solution). Scale bars, 40 um. 


effect is illustrated in Fig. 3e, where excess non-magnetic particles connect 
nearby posts along ‘arcs’ in which magnetic forces are balanced by gravity 
(that is, these particles are magnetically ‘levitated’ above high-field 
nickel regions; for calculations see Supplementary Fig. 10). The most 
sophisticated structure, shown in Fig. 3f, is inspired by the architecture 
of the famous Blue Mosque in Istanbul and features larger ‘domes’ con- 
nected by arcs and surrounded by four smaller and unconnected sate- 
llite ‘domes’. The heights of the copulas and the arrangement of arcs 
reflect the varying widths of different portions of the underlying nickel 
grid (and thus various field strength and forces acting on the assem- 
bling particles at different locations; see Supplementary Fig. 10). These 
examples illustrate that we can create intricate three-dimensional struc- 
tures, but they are spatially limited by the decay of the magnetic field 
away from the grid. This constraint can be overcome by aligning two 
nickel grids that are each placed on a magnet, thereby modulating the 
magnetic field through the entire Ho(NO3); solution layer between the 
grids (see Figure 3b). In such fields, non-magnetic particles assemble 
according to the field profiles into egg-shaped structures that connect 
the two grids (Fig. 3g, h). The same principles guide the assembly of 
three-dimensional colloidal molecules when using grids with smaller 
periodicities that are commensurate with particle sizes. Figure 4 shows 
examples of such structures, including single-component (for example 
A,4-A;) and binary (for example A,B and ASB) clusters of diamagnetic 
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Figure 4 | Assembly of three-dimensional colloidal molecules. 

a, b, d, Single-component ‘molecules’ of diamagnetic particles: triangular, Ay 
(a), square, A; (b) and pentagonal, Ag (d), pyramids made of 1.2-|1m particles. 
c, Tetrahedra, Ay, made of 2.5-\1m particles. e, Particles 2.5 [um in size 
assembled into A, structures. f, g, Multicomponent assemblies of diamagnetic 
particles: A,B (f) and A;B pyramids (g); A particles (green) are 1.2 um, 

and B (red) are 800 nm. Colour coding in the confocal images in 

(a-e) corresponds to height; in f and g different colours represent different 
types of fluorescently labelled particle. Insets show SEM images. Scale bars, 

2 um. Fidelities for all assemblies were f> 80%. In single-component 
structures, all particles were functionalized with polyethylenimine 

(PEI; molecular mass 25 kDa) such that they bound to the isocyanate- 
functionalized grid surface and also to one another, by chain entanglement”. 
These polymer-polymer interactions promoted attachment of the top-layer, 
lower-trapping-stiffness particles (see Supplementary Fig. 3a). 

For multicomponent structures, planar clusters were first assembled from 
amine-modified particles forming carbamide bonds with the substrate; they 
were subsequently modified with isocyanate groups to which the top-layer 
PEI-functionalized particles of another type then bound. Once made, all 
structure types were permanently fixed by crosslinking amine functionalities 
with glutaraldehyde (50 pl of 25% solution per 1 ml of Ho(NOs); solution; for 
further details see Supplementary Information). 


particles assembled over micrometre-sized islands. In all cases the assem- 
blies form reliably over large areas with fidelities better than 80%, which 
significantly exceeds the yield achieved with other recent strategies*'*”’ 
for creating colloidal molecules (see Supplementary Fig. 11). 

The magnetic moulds can manipulate objects other than colloids. 
Perhaps the most striking example is shown in Fig. 5a, in which live 
Staphylococcus aureus bacteria in Ho(NO3); at a concentration of less 
than 0.3 M (see also Supplementary Fig. 12) are positioned onto the 
voids ofa square grid. The hybrid structure in Fig. 5b comprises smaller 
live bacteria forming halos around larger non-magnetic colloids centring 
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on the grid’s voids (see the size-dependent effect discussed above and 
also Supplementary Fig. 6c, d). These circular bacterial colonies can 
be made permanent by means of glutaraldehyde crosslinking and can 
be liberated intact into solution (with bacteria still alive). The virtual 
moulds can also manipulate collections of objects with dimensions 
much smaller than the nickel microgrids: Fig. 5c and Fig. 5d show gold 
nanoparticles and gold nanorods, respectively, placed onto the voids 
of larger nickel grids. 

We can also exploit the impact of magnetic microgradients on the 
spatial distribution of ionic species in the paramagnetic liquid: in these 
experiments we achieve the selective precipitation of gold salt from a 
solution of HAuCl1,*3H,O in Ho(NO3); in the low-field regions, either 
above an array of macroscopic magnets (Fig. 5e; see also Supplementary 
Information and Supplementary Fig. 13) or over the microscopic voids 
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Figure 5 | Magnetic positioning of live bacteria, nanoparticles and ions. 

a, SEM image of S. aureus (~0.7 [1m in diameter) assembled onto 1.5-1m voids 
of a square nickel/PDMS grid. Inset: corresponding confocal image with 
bacteria fluorescently labelled (green fluorescence indicates that bacteria are 
alive; see Supplementary Information). b, Confocal and SEM (inset) images of 
bio-colloidal hybrids comprising S. aureus ‘halos’ around 2.5-\1m colloidal SiO, 
particles on a square nickel grid with 4-j1m circular voids. Scale 

bars, 3 jum (a,b). c, d, SEM image of 5.5-nm gold nanoparticles (c) and 
bright-field microscopy image of gold nanorods 40 nm long (d) assembling 
onto the voids of a square nickel grid. Scale bars, 100 um (50 nm in the insets). 
e, Right: optical image illustrating localized precipitation of Au** ions onto a 
glass slide placed above an array of macroscopic permanent magnets (left; see 
also Supplementary Fig. 14). Scale bar, 4mm. f, SEM image illustrating 
localized precipitation of Au’ ions onto the voids of a square nickel/PDMS 
microgrid. g, Corresponding elemental map for gold recorded by 
energy-dispersive X-ray spectroscopy (on SEM $U8030). Scale bars, 50 um 
(50 nm in the insets). h, Illustration of ionic concentration gradients leading to 
the localized precipitation of diamagnetic ions. The paramagnetic Ho** ions 
(purple) are concentrated in the centre area between the two magnets (at the 
maximum of the magnetic field), and the diamagnetic Au’* ions (yellow) are 
expelled from these high-field regions. The specific concentrations of Au’* and 
Ho** ions shown were determined experimentally by ICP-AES experiments. 
The red curve shows the magnetic profile calculated in COMSOL. 


of the nickel/PDMS grid (Fig. 5f, g). Small solution aliquots sampled at 
different locations and analysed by inductively coupled plasma atomic 
emission spectroscopy (ICP-AES) show that Ho(NO3); is concentrated 
in the high-field regions and HAuCl, is concentrated in the low-field 
regions, where it supersaturates and precipitates (Fig. 5h and Supplemen- 
tary Fig. 14). These observations contrast with the belief**”* that external 
magnetic fields change only the field inside the fluid but not the salt 
concentrations, and also suggest that magnetic moulding might find 
use in separating metal salts on the basis of their different magnetic 
susceptibilities (for example the retrieval of uranium from solubilized 
brannerite, UTi.Og, for which x = 1.21 X 10° * for uranium(IV) oxides, 
in contrast with 7 = 3.93 X 10° for titanium(IV) oxide). 

One promising avenue for further development of our method is the 
miniaturization of the system to nanoscopic dimensions (using, for 
instance, nanohole” nickel grids) so that individual nanoobjects can 
be manipulated, although this will probably require stronger fields to 
overcome the effects of thermal noise. We also expect that the use of 
less cytotoxic magnetic fluids (such as the chelated organic gadolinium 
complexes that are used. as MRI contrast agents such as gadoterate meg- 
lumine with y = 1.60 X 10 *) would permit wider uses of the method 
for positioning and manipulating cells. 


METHODS SUMMARY 


Nickel grids were fabricated by standard photolithography, dissolution of the exposed 
photoresist regions, e-beam evaporation of the 200-nm nickel layer, and lift-off of 
the masking photoresist by sonication. A layer of PDMS about 300-1,000 nm 
thick was then spin-coated for 45 s at 5,000 r.p.m. onto the nickel grid to yield a 
flat composite structure. After curing of the PDMS at 65 °C overnight, its surface 
was functionalized with 3-(triethoxysilyl)propylisocyanate (TPI) by immersion 
for 1 hina 10% TPI solution in acetonitrile at 70 °C. This functionalization allowed 
the subsequent covalent attachment of amine functionalized particles. The part- 
icles used in this study were either polymeric or silica particles purchased from 
Bangs Laboratories and were fluorescently labelled with either fluorescein isothio- 
cyanate (FITC) or rhodamine isothiocyanate (RITC). Superparamagnetic particles 
had magnetic susceptibilities ‘eatt =~ 0.170-0.392, whereas diamagnetic particles 
had susceptibilities ne ~—1.17 X 10°. These particles were suspended in a 0.2- 
0.4M solution of the paramagnetic salt Ho(NO3)3 in DMSO/H30. To prevent 
aggregation of the particles by van der Waals forces, the suspensions were always 
refractive-index-matched by the addition of an appropriate amount of DMSO/ 
H,0 (75% DMSO (w/w)). Particles that assembled and attached to the PDMS were 
bonded to one another in a Stdber solution (50 ul of tetraethyl orthosilicate per 
1 cm” of wafer area); after 2 h they were liberated into solution by sonication for 10s. 
Simulations of the magnetic fields above nickel grids were performed using COMSOL 
Multiphysics software with relative magnetic permeabilities of air (/4-(air) = 1) and 
of nickel ({4;(nicket) = 200), and with magnetization of the permanent magnet equal 
to 105,000 Am’. 
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Silicate liquids play a key part at all stages of deep Earth evolu- 
tion, ranging from core and crust formation billions of years ago 
to present-day volcanic activity. Quantitative models of these pro- 
cesses require knowledge of the structural changes and compression 
mechanisms that take place in liquid silicates at the high pressures 
and temperatures in the Earth’s interior. However, obtaining such 
knowledge has long been impeded by the challenging nature of the 
experiments. In recent years, structural and density information for 
silica glass was obtained at record pressures of up to 100 GPa (ref. 1), 
a major step towards obtaining data on the molten state. Here we 
report the structure of molten basalt up to 60 GPa by means of 
in situ X-ray diffraction. The coordination of silicon increases from 
four under ambient conditions to six at 35 GPa, similar to what has 
been reported in silica glass'*. The compressibility of the melt after 
the completion of the coordination change is lower than at lower 
pressure, implying that only a high-order equation of state can 
accurately describe the density evolution of silicate melts over the 
pressure range of the whole mantle. The transition pressure coin- 
cides with a marked change in the pressure-evolution of nickel 
partitioning between molten iron and molten silicates, indicating 
that melt compressibility controls siderophile-element partitioning. 

Density measurements of silicate melts are much sought after, 
because density is the primary factor controlling the evolution of mag- 
mas at depth. Pioneering results have been obtained using shock-wave* 
and ex situ sink—float experiments*® at upper-mantle conditions, de- 
monstrating that basalts are more compressible than their crystalline 
counterparts, which implies that density inversion must occur at depth. 
This behaviour can be explained by the different compression mecha- 
nisms available in liquid silicates, compared to the crystalline structures, 
including the collapse of ring structures at low pressure and gradual 
coordination changes. A key point to be investigated is how and where 
the transition from fourfold- to sixfold-coordinated Si occurs in mag- 
mas at depth and, in turn, how the transition affects melt compressi- 
bility. Structural data has been obtained on molten silicates using 
X-ray diffraction and large-volume presses’; but they are limited below 
10 GPa, that is, below the expected transition. Only shock-wave experi- 
ments have been able to probe the properties of magmas at deep- 
mantle conditions, with data up to 127 GPa reported for molten 36% 
anorthite-64% diopside*. However, these data are difficult to interpret 
in terms of an isothermal equation of state’®. 

We collected in situ X-ray diffraction data in laser-heated diamond 
anvil cells at the extreme-conditions beamline P02.2 at the PETRAIII 
synchrotron (the Positron-Electron Tandem Ring Accelerator III Deu- 
tsches Elektronen-Synchrotron) in Hamburg, Germany. The choice of 
the basaltic sample was made for the following reasons: (1) basalt is the 
most commonly produced magma, (2) its SiO, content (50.2 weight 
per cent) is high enough that the Si-O coordination change can be 
tracked from structural data, (3) its FeO content (7.81 weight per cent) 
is sufficient to couple with the infrared laser without the need to add a 


metallic coupler, (4) its melting curve and petrological relationships 
are well documented in our targeted pressure-temperature range”. 
The observation of diffuse rings with the simultaneous absence of any 
crystalline diffraction peaks apart from those of the SiO, pressure- 
transmitting medium was used as the criterion to determine melting. 
Apart from the lowest-pressure run, which was quenched as glass, all 
melts fully recrystallized on quenching at room temperature (20 °C). 
Their mineralogical composition as identified from X-ray diffraction 
data agrees with previous petrological studies'*"', and is reported in 
Table 1 along with experimental pressure-temperature conditions. 
The intensity data (Fig. 1a) are converted into the structure factor 
S(q), where q is the scattering vector, using the Ashcroft-Langreth 
formalism. The radial distribution function g(r) (Fig. 1b) is obtained 
by Fourier transforming of S(q). Changes of intensity occur gradually 
up to 35 GPa, with the first sharp diffraction peak shifting to higher q 
values and decreasing in intensity, interpreted as a collapse of the SiO, 
tetrahedral network, a second diffuse peak growing at 3.3 A anda 
relative decrease of the intensity of the third diffuse peak at 4.5 A7'. 
These changes are expressed in g(r) through a shift of the first contri- 
bution to slightly higher distances, and by a broadening of the second 
main contribution. These changes are very similar to those observed**” 
in silica glass, and occur over the same pressure range, that is, between 
10 GPa and 35 GPa. The shift of the first contribution in g(r) to higher 
distances reflects increased coordination of the Si-O contribution. The 
shouldering of the second contribution at 2.5 A has been related to O-O 
interactions as neighbouring polyhedra participate in edge-sharing’. 
The radial distribution functions obtained are the sum of all ion-ion 
contributions. To quantify the increase of Si-O coordination number, 
we have simulated the experimental g(r) against a sum of Gaussians (see 
Methods), assuming that the Al-O coordination number is 6 for pres- 
sure exceeding 15 GPa, as is shown to be the case in the glassy state’’. 
The fitted Si-O interatomic distance dsj_o and coordination number 
are reported in Fig. 2. Si-O coordination increases from 4.5(0.2) at 
15 GPa to 6.0(0.1) at 35GPa. Concomitantly, ds;_o increases from 
1.62(0.01) Ato 1.66(0.02) A. (Note that parentheses indicate lo errors.) 
In the quenched samples, sixfold-coordinated Si-bearing phases appear 


Table 1 | Experimental pressure-temperature conditions and results 


Run P (GPa) T(K) Measured density Observed quenched products 
(kgm *) 

i 10.5 2,200 3,611 +69 Glass 

2 15 2,273 3,821 +70 st, cpx, gt 

6 23.7 2,473 Not determined st, gt 

5 24.6 2,573 4,313 +52 st, gt, pv, NAL, cf 

13 35 2,700 4,523 + 53 st, Ca-pv, Mg-pv, NAL, cf 

8 45 2,973 4,750+71 st, Ca-pv, Mg-pv, cf 

12 60 3,273 4,978 +70 st, Ca-pv, Mg-pv, cf 


Quenched products were identified on the X-ray diffraction patterns obtained after quenching laser 
heating. Mineral phases were stishovite (st), clinopyroxene (cpx), garnet (gt), calcium perovskite (Ca- 
pv), magnesium perovskite (Mg—pv) and new aluminium-rich phase (NAL). The calcium-ferrite phase 
(cf) is ascribed to an Al-bearing phase with stoichiometric formula [Na,Ca,Mg,Fe];[Al,Si,Fe,Mg]20a. 
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UMR-CNRS 7193, Institut des Sciences de la Terre Paris, F-75005 Paris, France. 7DESY Photon Science, Notkestrasse 85, D-22607 Hamburg, Germany. “Faculty of Earth and Life Sciences, Vrije Universitat 
Amsterdam, 1081 HV, The Netherlands. Institut ftir Geowissenschaften, Goethe-Universitat Frankfurt, D-60438 Frankfurt am Main, Germany. 
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Figure 1 | Structure of molten basalt at high pressures. a, Intensity scattered 
by molten basalt at high pressure. FSDP, first sharp diffraction peak. The dotted 
lines are the patterns obtained after subtraction of the background signal (see 
Methods), and contain contribution from the melt and diffraction Bragg peaks 
from stishovite, the pressure-transmitting medium. These Bragg peaks have 
been erased in the continuous black data. b, Corresponding radial distribution 
functions. 


at 25 GPa (Table 1). Most of the Si-coordination change in the melt thus 
occurs at lower pressure than in the crystalline phases, but is completed 
only by 35 GPa. First-principles molecular dynamics calculations pre- 
dict a more gradual fourfold-to-sixfold coordination increase in molten 
anorthite and diopside’*”’. Instead, Si-coordination changes in the natural 
basaltic composition are more consistent with calculations on pure 
silica melt'®. Our data also compare favourably with calculations in 
that bond lengths initially increase on compression. 

In SiO} glass, the structural changes are accompanied by a stiffening 
of the glass’"”, In this work, we measured the density of melts using the 
X-ray-diffraction-based method first applied to diamond-anvil cell 
experiments at moderate pressure (ref. 18), and later extended to the 
megabar range’’. It consists of minimizing the oscillations in g(r) 
where there should not be any signal, that is, below the minimum 
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Figure 2 | Pressure evolution of Si-O coordination number and dg; in 
molten basalt. a, Si-O coordination number versus pressure; b, ds;9 versus 
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interatomic distance. However, this method requires that the back- 
ground, essentially the Compton signal from the diamond anvils that 
dominates the total diffracted intensity, is perfectly subtracted (see 
Methods). The density values we obtained (Fig. 3) fall within the 
error bars on the shock-wave isentrope measured for the model 36% 
anorthite-64% diopside composition and applied to a basaltic com- 
position* at upper-mantle conditions. 

At higher pressure, there is a clear deviation from this shock-wave 
isentrope, with a stiffening of the melt that resembles what has been 
observed in other shock-wave studies’, although the absolute density 
values are different. This stiffening is related to the fact that Si is the last 
of the major elements to undergo a coordination increase, so that 
above 35 GPa, bond-length shortening gradually overtakes coordina- 
tion change as the dominant compression mechanism. Consequently, 
our density data can be fitted only with a high-order equation of state. 
A fourth-order Birch-Murnaghan fit to the data reduced to a common 
isotherm (2,735 K; see Methods) is obtained for the following values 
of the isothermal bulk modulus (Ky. 9) and its pressure derivatives (K’ 
and K"): Ko735x,0 = 24.0 + 1.7 GPa, K'2735K,0 = 0.664 + 0.032 and 
K"3.735K,0 = —0.057 + 0.057 GPa‘, using p2735K.9 = 2.480kgm * 
as a regular point®. This equation of state is remarkably close to that 
obtained theoretically for anorthite melt* (Fig. 3), also a fourth-order 
finite strain expansion, and slightly different to that of the less poly- 
merized diopside melt’’. The latter was a third-order fit, as for other 
less polymerized melts”, whereas a fifth-order expansion is necessary 
to represent pure SiO, liquid’®. The presence of Fe in our composition 
might explain part of the deviation from the shock-wave and theoretical 
studies that were conducted on Fe-free compositions. Fe-rich melts are 
expected to have a higher densification rate than their Mg counterparts 
because the Fe-coordination number starts to increase at lower pressure 
than the Mg-coordination number does in Mg-rich melts”. 

Our data provide the first in situ experimental insight into the be- 
haviour of a magma ocean of unknown and evolving composition. In 
the context of the crystallization of a magma ocean, the difference in 
density between melts and crystals will be at its highest at the bottom of 
the upper mantle, that is, above the fourfold-to-sixfold Si-coordination 
transition in crystals. At around 25 GPa, the crystalline silicate phase is 
denser than the melt because Si is fully converted into sixfold coordi- 
nation in the former. Silicate melts should also be denser than the solid 
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pressure. Present data are compared to theoretical calculations on silicate 
melts'*'*, and to experimental data on SiO, glass**’’. Errors, lo. 
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Figure 3 | Density of molten basalt as a function of pressure. Density of 
molten basalt determined here, and compared to previous sink-float data>®. 
Also shown are the seismological density profile for the Earth’s mantle’, the 
theoretically derived equations of state for molten anorthite’ and diopside’ at 
3,000 K applied to basalt using its ambient pressure density, the shock-wave 


phase at the bottom of the lower mantle and will accumulate on top of 
the core. We should therefore expect a layered magma ocean with a 
crystalline layer separating an upper magma ocean from a basal one, as 
posited by thermal evolution models”. On the basis of previous sink- 
float measurements, it has been proposed that deep melting in the early 
Archaean Earth generated negatively buoyant melts that would accu- 
mulate on top of the 410-km discontinuity”. These Fe-rich melts 
would have sunk into the lower mantle as they crystallized to litholo- 
gies that are denser than ambient mantle, although that would depend 
on Fe partitioning at depth. The present measurements extend the 
negatively buoyant zone for mafic melts down to the 660-km discon- 
tinuity (Fig. 3). 

Strikingly, the reported structural and compressibility changes coin- 
cide with a marked change in the pressure evolution of the distribution 
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Figure 4 | Pressure evolution of nickel partitioning coefficient between 
metal and silicate melts. Dis defined as the concentration by weight of nickel 
in molten metal divided by its concentration in a coexisting silicate melt. Data 
are from refs 24-29. Below 35 GPa, the data plot as a downward-sloping 
straight line (black) from which they deviate at higher pressures. Errors, lo. 
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isentrope equation of states measured on a model 36% anorthite-64% diopside 
composition and applied to basaltic composition’, and the shock-wave-derived 
isothermal equation of state for a 36% anorthite-64% diopside composition®. 
BM, Birch-Murnaghan equation of state; BM4, 4th-order BM. Errors, lo. 


of nickel between metal and silicate melts, as expressed by the nickel 
partition coefficient D’ (Fig. 4). The depth of a magma ocean is cla- 
ssically estimated based on the measured pressure-dependence of side- 
rophile elements partitioning between molten silicate and metal. Recent 
estimates**”° exceed 30 GPa and range up to 60 GPa, with most models 
relying on extrapolations of experimental data restricted to lower pres- 
sures (<25 GPa). The coincidence between the change in silicate melt 
compressibility and nickel partitioning implies that data obtained below 
35 GPa should not be used to constrain partitioning at deeper mantle 
pressures. We reported a similar coincidence between change in the 
pressure evolution of siderophile-element partitioning and the com- 
pressibility change in molten Fe alloys at 5 GPa (ref. 30). Partitioning of 
an element between two phases is controlled by its coordination geo- 
metry and site compressibility in each phase. At ambient pressure, 
nickel is in tetrahedral sites*’, but it may occupy more octahedral sites 
as pressure increases”. This results in an increased crystal field stabi- 
lization energy of these ions in the silicate melt, and stabilizes them in 
the silicate relative to the metal phase, which explains the initial decrease 
of D™' with increasing pressure. 

It is unlikely that Ni coordination further increases after completion 
of the fourfold-to-sixfold coordination transition in Si. The flattening 
observed around 35GPa may rather be explained by a change in 
site compressibility. Silicate melt stiffening implies a convergence of 
site compressibility between silicate and metal melts, resulting in a 
weak pressure-dependence of D™' above 35 GPa. An unfortunate con- 
sequence of this weak pressure-dependence is that error bars on 
magma ocean depths need to be revised upwards, if they can be deter- 
mined at all. These examples illustrate that models of the magma ocean 
era, including segregation of the molten Fe core and crystallization 
of the magma ocean, must incorporate pressure-induced structural 
changes in melts. 


METHODS SUMMARY 


We used symmetric diamond-anvil cells equipped with 70°-opening Boehler- 
Almax seats to access a wider q range up to9 A! and reduce the diamond Comp- 
ton contribution, given that Boehler-Almax anvils are only 1.5mm thick. The 
X-ray monochromatic beam (42 keV) was focused down to a size of 2 um X 4 jum, 
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whereas the double-sided infrared laser was focused down to 20 um. The starting 
glass sample (Extended Data Table 1) was loaded as a 20-1m-thick platelet 
between two equally thick platelets of compressed SiO, powder. The SiO, platelets 
act as thermal insulators and as the pressure-transmitting medium. To limit iron 
migration away from the laser heating spot that is due to the Soret effect, the laser 
shutters were opened only once the targeted power was reached, and held open for 
10s during which ten X-ray diffraction patterns of acquisition time 1 s each were 
recorded on a Perkin-Elmer 2D detector. Targeted power was increased in 2-W 
increments from 20 W to 40 W of power on each laser depending on pressure until 
complete melting of the sample occurred. Only one sample was used per pressure 
point to avoid repeated laser-heatings and to preserve the chemical integrity of the 
sample. To isolate the scattered intensity from the molten basalt only, each sample 
was removed from the gasket, and the gasket put back in place to collect X-ray data 
on the empty cell. Obtained patterns were then scaled vertically to match the 
baseline of X-ray patterns collected on the quenched crystalline sample still under 
pressure. This last step ensures that any pressure effect on the background is 
corrected for. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


We used symmetric diamond-anvil cells equipped with 70°-opening Boehler- 
Almax seats to access a wider q range up to 9A~ and reduce the diamond 
Compton contribution, given that Boehler-Almax anvils are only 1.5 mm thick. 
The X-ray monochromatic beam (42 keV) was focused down toa size of 2 [um X 4 |tm, 
whereas the double-sided infrared laser was focused down to 20 jum. The starting 
glass sample, a basalt obtained by grinding and fusing in air a Stromboli pumice™ at 
1,400 °C, was loaded as a 20-|1m-thick platelet between two equally thick platelets of 
compressed SiO, powder. The SiO, platelets act as thermal insulators and as the 
pressure-transmitting medium. To limit the iron migration away from the laser 
heating spot that is due to the Soret effect, the laser shutters were opened only once 
the targeted power was reached, and held open for 10s during which ten X-ray 
diffraction patterns of acquisition time 1s each were recorded on a Perkin-Elmer 
two-dimensional detector. Targeted power was increased in 2-W increments from 
20 W to 40 W of power on each laser depending on pressure until complete melting 
of the sample occurred. Only one sample was used per pressure point to avoid 
repeated laser-heatings and to preserve the chemical integrity of the sample. To 
isolate the scattered intensity from the molten basalt only, each sample was removed 
from the gasket, and the gasket put back in place to collect X-ray data on the empty 
cell. The patterns obtained were then scaled vertically to match the baseline of X-ray 
patterns collected on the quenched crystalline sample still under pressure. This last 
step ensures that any pressure effect on the background is corrected for. 

Some samples recovered after the experiments were embedded in epoxy and 
polished for electron microprobe analysis at the EMMAC centre of the University 
of Edinburgh (see Extended Data Table 1). We used a defocused beam size of 
10 um to get an average composition at the laser-heated spot. Conditions were 
15-keV voltage and 2-nA current for the analysis of K, Ca, Na, Mg, Al, Fe and Si, 
and 15keV and 80nA for the analysis of Ti, Mn and P. Electron microprobe 
analysis shows that no detectable contamination of the sample by the SiO, platelets 
occurred; silica (being Fe-free) is heated only conductively by the laser-heated 
basalt, and for a short duration (10s), so potential chemical exchanges are limited. 

A ruby sphere was added in the chamber for pressure measurements at room 
temperature using the ruby fluorescence method*. Two-dimensional patterns 
were integrated using the Fit2D software’’. Because we used the off-axis heating 
system to avoid using carbon mirrors that would add to the background signal, 
temperature could not be measured by pyrometric techniques. It was instead 
assumed that the temperature followed the melting curve of basalt as determined 
up to 60 GPa (ref. 9). Using these temperature values, pressure at high temperature 
was calculated using the equation of state of stishovite’’”. 

To determine the Si-O coordination number, the experimental g(r) was simu- 
lated against a sum of Gaussians (where each Gaussian corresponds to a particular 
ion-ion partial distribution function) with the following equation: 
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where CN; is the coordination number of the ith ion-ion contribution, d; is the 
corresponding interatomic distance and o; is a parameter depending on structural 
disorder®’, o; =ky/d;; k is an adjustable parameter (typically 0.15-0.17). 

The Al-O coordination number is assumed to be six for pressure exceeding 
15 GPa, as has been shown to be the case in the glassy state’. First-principles 
molecular dynamics calculations, however, predict a full conversion to sixfold 
coordination for the anorthite melt at 3,000 K at 35 GPa only", with coordination 
numbers of 5.47 and 5.8 at 15 GPa and 25 GPa, respectively. If we use these values, 
then our fitted coordination number for Si-O should be increased by 0.18 at 
15 GPa and 0.07 at 25 GPa. 

To fit our density data with an equation of state, we first reduced the data to a 
common isotherm as the data were collected along the melting curve. In the 
absence of data on the thermal expansion coefficient « for basalt at high pressure, 
we have calculated « assuming that «Ky is a constant, and chosen the mean 
experimental temperature T as the appropriate isotherm. The assumption that 
aK is constant is not true, as shown by first-principles calculations, and so we 
overestimate at high pressure. However, this gives a maximum limit on the 
thermal correction that needs to be applied to our data. 
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Extended Data Table 1 | Chemical composition of the starting sample 
and recovered samples 


Oxide — Starting composition Recovered samples 


Ref.* (mean of 3 analyses) 

KO 1.86 (0.06) 1.52 (0.42) 
CaO 12.4 (0.2) 11.60 (0.29) 
Na2O 2.29 (0.05) 1.88 (0.55) 
MgO 7.87 (0.12) 8.03 (0.12) 
Al2O3 15.2 (0.3) 15.06 (0.58) 
FeO 7.81 (0.33) 7.23 (0.43) 
SiOz 50.2 (0.3) 49.97 (0.79) 
P205 0.64 (0.06) 0.41 (0.04) 
TiO2 0.84 (0.04) 0.76 (0.04) 
MnO 0.16 (0.05) 0.12 (0.05) 
Total 99.27 96.58 


One standard deviations are given in parentheses. 
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Mating advantage for rare males in wild guppy 


populations 


Kimberly A. Hughes’, Anne E. Houde’, Anna C. Price® & F. Helen Rodd? 


To understand the processes that maintain genetic diversity is a 
long-standing challenge in evolutionary biology, with implications 
for predicting disease resistance, response to environmental change, 
and population persistence’*. Simple population genetic models 
are not sufficient to explain the high levels of genetic diversity some- 
times observed in ecologically important traits”. In guppies (Poecilia 
reticulata), male colour pattern is both diverse and heritable, and is 
arguably one of the most extreme examples of morphological poly- 
morphism known*”. Negative frequency-dependent selection (NFDS), 
a form of selection in which genotypes are favoured when they are 
rare®, can potentially maintain such extensive polymorphism, but 
few experimental studies have confirmed its operation in nature”®. 
Here we use highly replicated experimental manipulations of natural 
populations to show that males with rare colour patterns have higher 
reproductive fitness, demonstrating NFDS mediated by sexual selec- 
tion. Rare males acquired more mates and sired more offspring 
compared to common males and, as previously reported, had higher 
rates of survival®. Orange colour, implicated in other studies of sexual 
selection in guppies, did predict male reproductive success, but only 
in one of three populations. These data support the hypothesis that 
NFDS maintains diversity in the colour patterns of male guppies 
through two selective agents, mates and predators. Similar field-based 
manipulations of genotype frequencies could provide a powerful 
approach to reveal the underlying ecological and behavioural mecha- 
nisms that maintain genetic and phenotypic diversity. 

Populations of organisms exhibit enormous genetic diversity. Explaining 
this diversity has been challenging, particularly when variation occurs 
in traits under strong natural selection’’. High genetic variation in 
ecologically important traits cannot be explained by standard popu- 
lation genetic models that incorporate directional natural selection, 
genetic drift and mutation”. Instead, various kinds of balancing selection, 
in which genotype fitness varies temporally, spatially, or as a function 
of genotype frequency, have been proposed to explain this diversity”. 
However, the ecological mechanisms that generate balancing selection 
are generally unknown’. 

The colour patterns displayed by adult male guppies (Fig. 1) are 
highly heritable, but also highly variable within populations”. Female 
mate choice has been proposed to account for this diversity because 
laboratory studies indicate a strong preference for rare or novel colour 
patterns'°”'”, If this preference occurs in nature, then it would result in 
NEDS on male colouration and promote variation. However, it can be 
difficult to detect NFDS in nature because the equilibrium frequencies 
of different phenotypes under this form of selection are those at which 
fitnesses are equal'*. Consequently, a rare-male advantage will be 
detectable only when phenotype frequencies are perturbed from their 
equilibrium values. 

To determine whether male guppies with rare colour patterns have a 
reproductive advantage in nature, we conducted 17 separate manip- 
ulations in 3 different populations in Trinidad®. Over two field seasons, 
we manipulated frequencies of naturally occurring colour patterns 
within replicate pools in the Mausica River (5 replicates) and in two 


separate tributaries of the Quare River (Quare River 1 (6 replicates); 
and Quare River 7 (6 replicates)). At each site, we sorted males into 
groups that were nearly equal in abundance, based on caudal fin colour, 
as ‘uncoloured’ (>75% of the caudal fin transparent) or ‘coloured’ 
(>50% of fin coloured); males with intermediate colouration were 
excluded from the experiment. Males were then re-introduced into 
pools in a ratio of 3:1 (in which 1 is ‘rare’), with each morph being 
rare in half of the pools. After 16 to 17 days, depending on the site, all 
adults were collected, separated by sex, and identified by pool-specific 
tattoos and pre- and post-release photographs. Adult females (n = 193) 
were taken to the laboratory, where we collected their first two broods 
(2 to 24 offspring per female, mean = 7.6). We genotyped these females, 
166 experimental males, 693 first-brood and 777 second-brood offspring 
at 9 to 14 variable microsatellite loci‘*'’. To avoid confounding repro- 
ductive success with differential survival, only experimental males that 
survived to the end of the experiment, and that did not move between 
pools during the experiment were considered as candidate fathers. 
Using a conservative 95% confidence level for paternity assignment'®, 
the number of offspring assigned to candidate fathers ranged from 0 to 
12, and mating success (the number of females with which a male 
produced at least one offspring) ranged from 0 to 8 females (Sup- 
plementary Table 1). 

Among first-brood offspring, males bearing rare colour patterns 
had more than twice as many mates and offspring as males bearing 
common patterns, based on the paternity assignment (Fig. 2a). Other 
experimental factors such as population, morph (the particular colour 
pattern chosen to be rare or common within a replicate), and popu- 
lation-by-morph interaction did not significantly affect the number of 
mates or offspring (Extended Data Table 1). Males with rare colour 
patterns thus had a reproductive advantage over those with common 
patterns, and this advantage did not depend on population or on the 
specific colour pattern that was rare or common in a given replicate 
(Extended Data Fig. 1). When we relaxed the criterion for paternity 


we OTe 


Figure 1 | Colour pattern variation among males from a single population. 
Male offspring of Quare River 7 tributary fish, reared in a common 
environment and showing heritable colour-pattern variation. Males on the left 
have a caudal fin that is representative of the ‘uncoloured’ group, those on right 
are have a caudal fin that is representative of the ‘coloured’ group. Criteria for 
classification are described in ref. 9. 


1Department of Biological Science, 319 Stadium Drive, Florida State University, Tallahassee, Florida 32306, USA. *Department of Biology, Lake Forest College, 555 North Sheridan Road, Lake Forest, Illinois 
60045, USA. *Department of Ecology and Evolutionary Biology, University of Toronto, 25 Harbord Street, Toronto, Ontario M5S 3G5, Canada. 
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Figure 2 | Rare colour patterns have higher mating and reproductive 
success. a, Number of mates and offspring assigned to common (white, 
n = 124) or rare (dark grey, n = 42) males. Centre values are marginal means 
from the full generalized linear mixed model; bars indicate s.e.m. adjusted for 
model covariance parameters. **P < 0.005 and *P = 0.01, respectively; see 
Extended Data Table 1. b, c, Association between square-root orange area and 
reproductive success in Quare River 7 (solid line, n = 34), Quare River 1 
(dashed line, n = 72) and Mausica River (dotted line, n = 60), predicted from 
the full model. b, Predicted mates. c, Predicted offspring; see Extended Data 
Table 2. Minimum and maximum values along abcissa indicate range of values 
recorded. 


confidence to >85%, the effects of all experimental factors were similar 
to the original analysis (Extended Data Table 2). The results are there- 
fore robust to the specific criterion for paternity assignment. 

We also examined effects of variation in male size (total body area), 
and colour (area of orange and black body colouration, Supplemen- 
tary Table 1) because these traits have been implicated in previous 
studies of sexual selection in guppies”’”. In generalized linear mixed 
models (GLMMs, see Methods), main effects of variation in these traits 
were not significant, but the population-by-orange area interaction 
was associated with both mate and offspring number (Extended Data 
Table 1). In both cases, the significant interaction was due to a positive 
association between reproductive success and orange area that was unique 
to the Quare River 7 population; the association was negative in Quare 
River 1, and close to zero in the Mausica River population (Fig. 2b, c 
and Extended Data Table 3). This population dependence is consistent 
with laboratory studies that document variation in the existence and 
strength of female preference for orange among natural guppy popula- 
tions’. Again, these patterns were robust when we relaxed the criterion 
for paternity assignment (Extended Data Table 2). 

Female guppies give birth to live offspring every 27 to 30 days and 
will fertilize eggs with stored sperm if they have not mated recently*’”. 
To distinguish the effects of recent insemination from long-term sperm 
storage, we separately analysed the second broods produced by females 
following the field experiment. These broods were produced at least 47 
days after the end of the field experiment, and a full reproductive cycle 
after the first broods; they were therefore fertilized by sperm stored for 
at least 47 days because guppies do not carry embryos at multiple stages 
of development’®. For these second broods, several aspects of male 
colouration, but not rarity, significantly influenced male reproductive 
success (Extended Data Table 4). Males with more highly coloured 
caudal fins had more assigned mates and offspring than those with 
little colour, while the number of orange spots on the body was nega- 
tively related to mate number (GLMM; 8 = —1.0+0.5, x? = 4.83, 
d.f. = 1, P= 0.03), and non-significantly related to offspring number 
(GLMM; B = —1.3.+0.6, x? = 3.53, d.f.=1, P=0.06). As in first 
broods, the area of orange body colour interacted significantly with 
population to predict male reproductive success. Again, the interaction 
occurred because orange was positively associated with reproductive 
success in Quare River 7, but not in the other two populations 
(Extended Data Table 5). 
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These results for second broods are consistent with laboratory data 
indicating that male colour can predict sperm competitive ability and/ 
or cryptic female choice!?. Nevertheless, if females mate at least once 
per reproductive cycle in nature, the first-brood data should better 
reflect male success because these data capture the effects of recent 
matings and short-term sperm storage. Two lines of evidence indicate 
that females do mate at least once per cycle: first, a high proportion of 
wild females have recently-deposited sperm in their gonoducts”””’; 
and second, all post-partum females re-mate within 48 h in laboratory 
studies'°’?. These data suggest that nearly all females mate at least 
once (and probably more than once) per cycle. 

Our results support the prediction of the ‘rare-male effect’ hypo- 
thesis that males with phenotypes that are rare in the local environment 
have a reproductive advantage over males with common phenotypes”. 
It is possible that we observed unusually strong NFDS because we 
perturbed morph frequencies in our experiment, and thereby made 
it more difficult to detect selection on other aspects of male pheno- 
types. However, it is precisely this dependence of selection on morph 
frequency that maintains variation. Our results indicate that if colour 
pattern frequencies deviate sufficiently from their equilibrium values, 
NEDS will dominate the evolutionary dynamics. 

Although we cannot be certain that the rare-male advantage we 
observed was caused by female mate preference, abundant evidence 
from laboratory studies indicates that females prefer males with rare 
phenotypes'!’'*”*, Moreover, female preference is more important 
than male-male competition in determining mating outcomes in this 
species”’”. These data indicate that rare-male advantage in the field is 
likely to be mediated by female mate preference. In contrast, male— 
male competition””*”° and sexual conflict”” have been implicated in 
the maintenance of discrete sex-limited polymorphism in other species. 
In each of those cases, the polymorphism involves two or three discrete 
morphs, compared to the much larger numbers of morphs found 
within guppy populations. The relative importance of female preference 
and other kinds of interactions in driving NFDS and in maintaining 
polymorphism is an intriguing direction for future research. 

Despite strong evidence for the rare-male effect in guppies, the evolu- 
tionary processes that account for its prevalence are not known. Mate 
preference for males with unusual colouration might have evolved as a 
mechanism for inbreeding avoidance’”, as a consequence of general- 
ized neophilia'®, or because females avoid remating with previous 
mates and also reject males with colouration similar to that of previous 
mates’. It has been previously proposed”* that a survival advantage to 
rare morphs, as demonstrated in ref. 8, could also drive the evolution of 
mate preference for rare phenotypes, even though rarity itself is not 
heritable. 

Understanding whether, and when, balancing selection maintains 
genetic variation in ecologically important traits is a central challenge 
for modern evolutionary biology, with profound implications for med- 
ical, agricultural and ecological genetics’. Manipulative experiments in 
nature have demonstrated balancing selection in guppies” and other 
species’”’. In guppies, these experiments have been powerful enough 
to reveal multiple ecological mechanisms that contribute to balancing 
selection and to reveal directional and balancing selection that act on 
the same phenotypes. We suggest that manipulations in natural popu- 
lations will be useful for discovering the ecological processes that main- 
tain other polymorphisms including those, like the vertebrate major 
histocompatibility complex, that have been refractory to other approaches”. 


METHODS SUMMARY 


At each site, we used pools of similar size, structure, substrate and water clarity to 
form replicate experimental pools*. After collecting all adult guppies from each 
experimental pool, all fish were photographed and given pool-specific elastomer 
tattoos before being used in the experiment. We also kept density and sex ratio 
close to natural levels. All procedures complied with animal care standards of the 
Canadian Council on Animal Care and were approved by University of Toronto’s 
Animal Care Committee. 
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The candidate fathers were the 166 marked experimental males that survived to 
the end of the experiment (60 from Mausica River, 72 from Quare River 1, and 34 
from Quare River 7). We used CERVUS 3.03 (ref. 16) for paternity assignment. 
For 135 first-brood offspring, paternity was assigned to an experimental male with 
>85% confidence; 93 were assigned with >95% confidence (Supplementary 
Tables 1 and 2). For the rest, the difference in log-likelihoods between the first 
and second most likely fathers indicated <85% probability that the most likely 
father was the true father, and these paternity assignments were not used in our 
analysis. For second broods, 158 offspring were assigned with >85% confidence 
and 102 with >95% confidence. We fit generalized linear mixed models to counts 
of assigned mates and offspring, using a negative binomial distribution and 
Laplace approximation for estimating the marginal likelihood. Pools nested within 
populations were modelled as random effects. Fixed effects included population, 
treatment (rare versus common), morph (coloured versus uncoloured caudal fin), 
body size (area of a two-dimensional image, excluding all fins), body area covered 
by orange, number of discrete orange and black spots, and all two-way interac- 
tions. Area covered by black and three-way interactions were also tested, but never 
approached significance (all P > 0.1) and were not included in the final models. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Field experiment. The sections of the Mausica River, Quare River 1 and Quare 
River 7 streams that we used had a pool-and-riffle structure that naturally limits 
migration between pools, particularly during the dry season, which is the time of 
year when our experiments were conducted*!. We also built temporary barriers 
between pools to reduce migration further. At each site, we used five to six pools of 
similar size, structure, substrate and water clarity to form replicate experimental 
pools®. We collected all adult guppies from each pool and sorted males into two 
groups that were nearly equal in abundance within sites, based on classification of 
caudal fin colour morphs as ‘uncoloured’ (>75% of the caudal fin transparent) and 
‘coloured’ (>50% of fin coloured); males with intermediate amounts of colour 
were excluded from the experiment. Fish were photographed and given pool- 
specific marks using a small elastomer injection. Males were reintroduced into 
pools in a ratio of 3:1 with each morph as the rare type in half the pools. Females 
that were used in the experiment were always reintroduced into the same pool 
from which they had been collected. We controlled for the density and size of the 
pools by maintaining the original, or slightly reducing, the numbers of adults that 
had been in the pools; we also maintained the natural sex ratio. Detailed methods 
and the number of males of each type and the number of females reintroduced into 
each pool are described in ref. 8. All procedures complied with animal care stan- 
dards of the Canadian Council on Animal Care and were approved by University 
of Toronto’s Animal Care Committee. 

Experimental males. We genotyped 166 marked experimental males that sur- 
vived to the end of the experiment (60 from Mausica River, 72 from Quare River 1, 
and 34 from Quare River 7); these were the recaptured ‘rare’ and ‘common’ males 
described in ref. 9. Of these, 124 were classified as common (43 in Mausica River, 
53 in Quare River 1, and 28 in Quare River 7) and 42 were classified as rare (17 in 
Mausica River, 19 in Quare River 1, and 6 in Quare River 7). We also genotyped 
193 adult females that survived the field experiment (55 from Mausica River, 83 
from Quare River 1, and 55 from Quare River 7), 693 first-brood offspring of these 
females, and 777 second-brood offspring. Animals from Quare River 1 and Mausica 
River were from the field experiment that was conducted in 2003, and animals 
from Quare River 7 were from the field experiment that was conducted in 1999. 
Sixty-one other males caught at the end of the experiment were unmarked and 
presumed to have matured during the experiment or to have migrated into the 
study area. Eight marked males migrated between pools during the experiment. 
We genotyped all mature males that were collected at the end of the experiment so 
that offspring could be assigned to them, but we did not include the unmarked 
males or the eight migrant males in subsequent analyses because they were present 
in the experimental pools or were sexually mature for an unknown period of time. 
In addition, 58 marked males did not survive to the end of the experiment’, These 
males were not genotyped and therefore not included in the paternity analysis. 
Paternity analysis. We genotyped 14, 11, and 9 loci, respectively, from fish of the 
Quare River 1, Quare River 7 and Mausica River populations, which had a mean 
number of 11, 16.2 and 11.6 alleles per locus. The individual who scored genotypes 
was blind to the experimental treatment group of the individuals. For paternity 
analysis, we used CERVUS 3.0.3 to simulate 100,000 offspring, assuming a mistyping 
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rate of 0.02, that 80% of candidate fathers were genotyped, a minimum of 5 loci 
typed, and a strict confidence level of 95%. The combined exclusion probabilities 
were 3.8 X 10°? in Quare River 7, 3.9 X 10° in Quare River 1, and 4.7 X 10° *in 
Mausica River. To assign paternity to a male, the difference in LOD (logarithm of 
odds) scores between the most likely candidate father and the second most likely 
candidate father (delta) had to exceed the 95th percentile of values produced from 
100,000 simulations. 

For 424 first-brood offspring, the most likely father was an experimental male; 
for the remainder the most probable father was a migrant or was unassigned. For 
135 offspring, paternity was assigned to an experimental male with >85% con- 
fidence; 93 were assigned with >95% confidence. For the rest, the difference in log- 
likelihoods between the first and second most likely fathers corresponded to <85% 
probability that the most likely father was the true father, and these paternity 
assignments were not used in subsequent analyses. For second broods, 435 off- 
spring were assigned to experimental males, 158 of these were assigned with >85% 
confidence and 102 with >95% confidence (Supplementary Table 2). 
Morphological measurements. Images were measured using ImageJ”. Areas were 
measured by outlining the region with the freehand tool. The area of the entire 
body, excluding the fins and gonopodium, was measured. Individual coloured 
spots were each outlined and measured separately. These measures of area were 
square root transformed before analysis to conform to model assumptions. For 
three males (two in Quare River 7 and one in Mausica River), pictures taken in the 
field were of such low quality that reliable morphological measures could not be 
obtained. These males were excluded from the analyses of quantitative traits, but 
were included in the analyses that only considered experimental treatment factors. 
Statistical analysis. We fit generalized linear mixed models to counts of assigned 
mates and offspring, using a negative binomial distribution, and Laplace approxi- 
mation for estimating the marginal likelihood. Replicate experimental pools 
nested within populations were modelled as random effects. Fixed effects included 
population, treatment (rare versus common), morph (coloured versus uncoloured 
caudal fin), body size (area of a two-dimensional image, excluding all fins), body 
area covered by orange, numbers of discrete orange and black spots, and all two- 
way interactions. Body area covered by black, and three-way interactions were also 
tested, but never approached significance (all P > 0.1) and were not included in the 
final models. All analyses were conducted in SAS 9.3 (ref. 33); generalized linear 
mixed models were implemented in Proc Glimmix. 

GPS coordinates of sites. The natural populations used in these experiments 
were located at: Mausica, PS 685749 1176906; Quare 1, PS 696972 1180687 (called 
Quare 2 in ref. 34); Quare 7, PS 697407 1179935. 


31.  Reznick, D.N., Butler, M. J. IV., Rodd, F. H. & Ross, P. Life-history evolution in 
guppies (Poecilia reticulata): 6. Differential mortality as a mechanism for natural 
selection. Evolution 50, 1651-1660 (1996). 

32. Schneider, C.A., Rasband, W. S. & Eliceiri, K. W. NIH Image to ImageJ: 25 years of 
image analysis. Nature Methods 9, 671-675 (2012). 

33. SAS Institute. SAS/STAT 9.3 User's Guide. (SAS Institute, 2011). 

34. Endler, J. A. A predator’s view of animal color patterns. Evol. Biol. 11, 319-364 
(1978). 
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Extended Data Figure 1 | Population means for effects of rarity and morph 
on reproductive success. Centre values are marginal means from generalized 
linear mixed models; bars indicate s.e.m. adjusted for model parameters. 
*P<0.01 and **P < 0.05, respectively. a, Mates assigned to common (white 
bars) and rare (dark grey bars) males in all populations (All, n = 124 common, 
42 rare) and by population (M, Mausica, n = 43 common, 17 rare; Q1, Quare 1, 
n= 53 common, 19 rare; Q7, Quare 7, n = 28 common, 6 rare). b, Offspring for 


common and rare males. c, Mates assigned to males with uncoloured (white 
bars) or coloured (dark grey bars) tails for all populations (All, n = 79 
uncoloured, 87 coloured) and by population (M, Mausica, n = 26 uncoloured, 
34 coloured; Q1, Quare 1, n = 35 uncoloured, 37 coloured; Q7, Quare 7, n = 18 
uncoloured, 16 coloured). d, Offspring for males with uncoloured and coloured 
tails. 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Table 1 | Effect of experimental factors and quant- Extended Data Table 2 | Effect of experimental factors and quant- 
itative traits on reproductive success in first-brood offspring itative traits on reproductive success 


P Mates F ea P 


Mates DF D 

Population 2 3.82 0.148 Population 2 2.89 0.269 
Rarity 1 9.09 0.003 Ra 1 450 0.036 
Population*rarity 2 425 0.120 Population*rarity 2 468 0.101 
Morph 1 1.75 0.185 Morph a 0.60 0.440 
Population*morph 2 043 0.805 Population*morph 2 2.83 0.247 
Body size . 1 0.01 0.919 Body size 1 0.13 «0.721 
Population*size 2 3.58 0.167 Population*size 2 2.86 0.244 
Orange area 1 0.16 0.689 Orange area 1 0.00 0.986 
Population*orange area 2 6.15 0.046 Population*orange area 2 6.92 0.034 
Number orange spots 1 1.24 0.266 Number orange spots 1 0.62 0.434 
Population*orange spots 2 0.91 0.634 Population*orange spots 2 0.65 0.722 
Number black spots 1 8131 (0.251 Number black spots 1 258 0.111 
Population*biack spots 2 531 0.070 Population*black spots 2 3.38 0.189 
Offspring Offspring 0 
Population 2 3.10 0.213 Population 2 2.39 0.332 
Rarity 1 6.69 0.010 Rarity 1 2.94 0.089 
Population*rarity 2 1.57 0.457 Population*rarity 2 3.03 0.223 
Morph 1 1.21 0.271 Morph 1 0.55 0.462 
Population*morph 2 0.78 0.677 Population*morph 2 3.26 0.200 
Body size 1 0.01 0.908 Body size 1 0.16 0.689 
Population*size 2 2.54 0.281 Population*size 2 2.00 0.370 
Orange area 1 0.87 0.352 Orange area 4 0.27 0.602 
Population*orange area 2 6.52 0.035 Population*orange area 2 6.37 0.045 
Number orange spots 1 2.75 0.098 Number orange spots 4 1.43 0.235 
Population*orange spots 2 1.36 0.506 Population*orange spots 2 1.23 0.542 
Number black spots 1 1.18 0.278 Number black spots 1 2.31 0.131 
Population“*black spots _2 3.18 0.204 Population*black spots 2 2.37__—0.309 


Significant effects are shown in bold. First-brood offspring, paternity confidence >85%. Significant effects are shown in bold. 
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Extended Data Table 3 | Population-specific estimates of the asso- 
ciation between area of orange body colour and reproductive success 


Mates Estimate SE DF t P 
Q7 18 10 128 1.86 0.066 
Qi -1,.3 08 128 -1.70 0.095 
M 0.1 10 128 0.14 0.893 
Q7 25 12 128 2.13 0.035 
Q1 “1.1 08 128 -140 0.164 
M 0.2 1.1 128 0.24 0.813 


Extended Data Table 5 | Parameter estimates for effects of morph 
(coloured — uncoloured) and the area of orange body colour, on num- 
ber of assigned mates and assigned offspring 


First-brood offspring and paternity confidence >95%. Estimates derived from the generalized linear 
mixed model. 


Mates Estimate SE DF t P 
Morph 1.5 08 128 -2.05 0.043 
Q7 3.4 1.3 128 260 0.010 
Q1 0.7 0.7 128 -1,01 0.310 
M 0.5 12 128 041 0.682 
Morph 2.0 09 128 -2.17 0.030 
Q7 4.0 14 128 281 0.006 
Q1 -1.2 098 128 -1.40 0.165 
M 0.3 1.3 128 022 0.826 


Second-brood offspring, paternity confidence >95%. 
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Extended Data Table 4 | Effect of experimental factors and quant- 
itative traits on reproductive success 


Mates DF P 
Population 2 1.24 0.539 
Rarity 1 0.90 0.344 
Population*rarity 2 1.38 0.503 
Morph 1 7.20 0.007 
Population*morph 2 1.34 0.511 
Body size 1 1.04 0.308 
Population*size 2 1.74 0.418 
Orange area 1 2.97 0.085 
“orange area 2 7.63 0.022 
Number orange spots 1 4.83 0.028 
Population*orange spots 2 1.44 0.488 
Number black spots 1 0.00 0.974 
Population*black spots 2 3.68 0.159 


Population 2 0.00 0.999 
Rarity 1 0.11 0.743 
Population*rarity 2 1.00 0.606 
Morph 1 7.37 0.007 
Population*morph 2 2.12 0.347 
Body size 1 1.38 0.241 
*size 2 0.08 0.960 

Orange area 1 2.06 0.152 
Population*orange area 2 9.76 0.008 
Number orange spots 1 3.53 0.063 
spots 2 3.96 0.142 

Number black spots 1 0.22 0.643 


: 


Second-brood offspring, paternity confidence 95%. Significant effects are shown in bold. 
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Genetic identification of a neural circuit that 


suppresses appetite 


Matthew E. Carter>*4, Marta E. Soden®*, Larry S. Zweifel** & Richard D. Palmiter!? 


Appetite suppression occurs after a meal and in conditions when it 
is unfavourable to eat, such as during illness or exposure to toxins. 
A brain region proposed to play a role in appetite suppression is the 
parabrachial nucleus’ *, a heterogeneous population of neurons 
surrounding the superior cerebellar peduncle in the brainstem. The 
parabrachial nucleus is thought to mediate the suppression of appe- 
tite induced by the anorectic hormones amylin and cholecystokinin’, 
as well as by lithium chloride and lipopolysaccharide, compounds that 
mimic the effects of toxic foods and bacterial infections, respectively*°. 
Hyperactivity of the parabrachial nucleus is also thought to cause 
starvation after ablation of orexigenic agouti-related peptide neurons 
in adult mice’’. However, the identities of neurons in the parabra- 
chial nucleus that regulate feeding are unknown, as are the func- 
tionally relevant downstream projections. Here we identify calcitonin 
gene-related peptide-expressing neurons in the outer external lateral 
subdivision of the parabrachial nucleus that project to the latero- 
capsular division of the central nucleus of the amygdala as forming a 
functionally important circuit for suppressing appetite. Using gen- 
etically encoded anatomical, optogenetic® and pharmacogenetic’ 
tools, we demonstrate that activation of these neurons projecting 
to the central nucleus of the amygdala suppresses appetite. In con- 
trast, inhibition of these neurons increases food intake in circum- 
stances when mice do not normally eat and prevents starvation in 
adult mice whose agouti-related peptide neurons are ablated. Taken 
together, our data demonstrate that this neural circuit from the 
parabrachial nucleus to the central nucleus of the amygdala med- 
iates appetite suppression in conditions when it is unfavourable to 
eat. This neural circuit may provide targets for therapeutic inter- 
vention to overcome or promote appetite. 

The parabrachial nucleus (PBN) contains subpopulations of neu- 
rons that regulate taste’®"', sodium intake'*”’, respiration”, pain’>"®, 
thermosensation’”"* and appetite suppression’ >”. To identify a specific 
genetic marker for PBN neurons that suppress appetite, we analysed 
expression of Fos, a surrogate marker of neuronal excitation, after 
genetic ablation of agouti-related peptide (AgRP) neurons or injection 
of lithium chloride (LiCl). AgRP neurons were ablated in mice expres- 
sing the human diphtheria toxin receptor (DTR) specifically in AgRP 
neurons (Agrp?™ mice)!’. Both AgRP neuron ablation (two diph- 
theria toxin injections at 50 .gkg ', intramuscular) and LiCl injection 
(84mgkg ', intraperitoneal) induced Fos expression in the outer 
external lateral subdivision of the PBN (PBelo; Supplementary Fig. 1). 
To identify a potential genetic marker for these neurons, we consulted 
the Allen Brain Explorer (http://mouse.brain-map.org)”° and searched 
for genes enriched in the PBelo. The top candidate was Calca, the gene 
that encodes calcitonin gene-related peptide (CGRP) and calcitonin by 
alternative splicing*'. Indeed, Fos expression in the PBN after ablation 
of AgRP neurons strongly overlapped with immunohistochemical 
detection of CGRP (Supplementary Fig. 2), similar to previous reports 
of coincident expression of Fos and CGRP after injection of LiCl or 
lipopolysaccharide (LPS)*°”?. 


To control gene expression in these neurons, we generated a genetic 
knock-in mouse expressing Cre recombinase at the Calca locus (Sup- 
plementary Fig. 3). When these mice were crossed with Cre-dependent 
TdTomato reporter mice, ubiquitous red fluorescence was detected 
throughout the brain, probably because of transient Cre expression 
during development. However, injection of a Cre-dependent adeno- 
associated virus (AAV) carrying a mCherry reporter directly into the 
PBN region of adult Calca“”* mice (Fig. 1a) resulted in specific expression 
of mCherry in CGRP-positive neurons in the PBelo (Fig. 1b and Sup- 
plementary Figs 4 and 5). 

To map activity in PBelo CGRP neurons, we compared virally targeted 
mCherry fluorescence with Fos following an array of environmental 
conditions that induced appetite suppression (see Supplementary Fig. 6 
for specific conditions used). In the lateral PBN, more than 80% of Fos 
expression co-localized with CGRP neurons after AgRP neuron abla- 
tion, intraperitoneal injection of LiCl, injection of LPS (Fig. 1c—h) or 
injection of the satiety hormones amylin or cholecystokinin (CCK; 
Supplementary Fig. 7a—d). In contrast, few Fos-positive neurons were 
observed in animals injected with saline (Fig. 1i, j), fasted for 24h or 
after aversive tail pinching (Supplementary Fig. 7e-h). The percentage 
of CGRP neurons co-expressing Fos significantly correlated with the 
reduction in food intake relative to baseline conditions (Fig. 1k, ]; see 
figure legends for P values and Supplementary Information for detailed 
statistical analyses). These results indicate that PBelo CGRP neurons 
are active during conditions in which appetite is suppressed but not in 
response to general adverse conditions. 

To determine whether transient stimulation of PBelo CGRP neurons is 
sufficient to reduce food intake, we unilaterally injected AAV carrying a 
Cre-dependent channelrhodopsin-2 transgene (AAV1 DIO ChR2- 
mCherry)* into the PBN of Calca*”* mice (Fig. 2a). Photostimulation 
reliably induced action potentials in mCherry-positive neurons in acute 
brainstem slices at several frequencies (20-40 Hz; Supplementary Fig. 8a) 
and in vivo photostimulation at 30 Hz was sufficient to induce expres- 
sion of Fos (Supplementary Fig. 8b, c). Stimulation of CGRP neurons 
in vivo for 5 min at 30 or 40 Hz (10-ms pulses) significantly and reversibly 
suppressed food intake during both baseline conditions and after a 
24-h fast (Fig. 2b, c and Supplementary Fig. 8d), demonstrating that 
activating these neurons is sufficient to suppress food intake. 
Suppression of feeding after photostimulation was rapid (within 
5-10 s; Supplementary Video 1) and reversible (mice typically resumed 
feeding in 5-10 min after photostimulation ceased). Stimulation at 
these frequencies did not impair movement or cause overt signs of 
distress (Supplementary Video 1). 

To determine the effects of longer-term stimulation of PBelo CGRP 
neurons, we unilaterally transduced these neurons with AAV carrying 
a Cre-dependent hM,;Dq-mCherry”” transgene (Fig. 2d). Stimulation 
of hM3Dq with clozapine-N-oxide (CNO, 1mgkg ') induced Fos 
expression (Supplementary Fig. 9) and suppressed food intake both 
during baseline conditions and after a 24-h fast (Fig. 2e, f). Chronic 
stimulation (once every 12 h for 4 d) resulted in a pronounced reduction 
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Figure 1 | Co-localization of PBelo CGRP neurons with Fos following 
conditions that reduce food intake. a, AAV carrying a Cre-dependent 
mCherry reporter injected into the PBN. Grey and black triangles represent 
loxP and lox2722 sites, respectively. b, mCherry expression in the PBelo. scp, 
superior cerebellar peduncle; scale bar, 500 um. c-j, Representative histological 
examples and quantification of coincidence of mCherry and Fos expression 
(n = 4 animals per condition). k, Degree to which various conditions reduce 
food intake (n = 4 animals per condition). 1, Appetite suppression correlates 
with the percentage of PBelo CGRP neurons expressing Fos. Dashed lines 
represent 95% confidence intervals. See Supplementary Information for 
statistical analysis. 


in food intake and body weight (Fig. 2g, h), demonstrating that long- 
term activation of these neurons is sufficient to cause starvation. 

To determine the effects of inhibiting PBelo CGRP neurons, we bilat- 
erally transduced these neurons with AAV carrying a Cre-dependent 
hM,Di-mCherry’” transgene (Fig. 3a). In acute brainstem slices, bath 
infusion of CNO reversibly reduced the firing frequency of hM,Di- 
expressing neurons to 24.98 + 8.96% of baseline (Supplementary 
Fig. 10a, b). There was no change in baseline food intake or body weight 
after intraperitoneal injection of CNO in either acute or chronic (once 
every 12h for 4 d) conditions (Supplementary Fig. 11). However, inhi- 
bition of CGRP neurons with CNO decreased the suppression of appe- 
tite observed after injection of LiCl or LPS (Fig. 3b). Consistent with 
this observation, inhibition by hM,Di also blocked the increase of 
Fos expression in the PBelo under these conditions (Supplementary 
Fig. 10c-f). Additionally, inhibition of CGRP neurons with CNO ame- 
liorated appetite suppression after injection of amylin and CCK, 
although not to statistical significance (Fig. 3b). Because genetic abla- 
tion of AgRP neurons induces Fos expression in PBelo CGRP neurons 
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Figure 2 | Stimulation of PBelo CGRP neurons reduces food intake and 
causes starvation. a, Placement of fibre optic implant in the PBN in a 
Calca"’”* animal injected with AAV DIO ChR2-mCherry. 

b, c, Photostimulation of CGRP neurons reversibly inhibits food intake in both 
baseline (b) and fasted (c) conditions (n = 8 animals per group); experiment 
replicated at least 20 times per animal in three groups of animals. d, Top, 
diagram showing AAV DIO hM3;Dq-mCherry transgene unilaterally injected 
into the PBN; bottom, timeline of experiments in e, f. e, f, Pharmacogenetic 
stimulation of CGRP neurons inhibits food intake in both baseline (e) and 
fasted (f) conditions (n = 6 animals per group); experiment replicated at least 
20 times per animal in three groups of animals. g, h, Chronic administration of 
CNO (every 12h for 4d) suppresses food intake (g) and reduces body weight 
(h) (n = 6 animals per group; experiment replicated in three groups of 
animals). *P < 0.05, **P < 0.01, ***P < 0.001; see Supplementary 
Information for statistical analyses. 


and leads to starvation’, we considered that bilateral inhibition of CGRP 
neurons would prevent starvation in these animals. To test this hypo- 
thesis, we bred Agrp?™* mice with Calca“’* mice and bilaterally 
injected Agrp?™’* /Calca“’* offspring with AAV virus carrying Cre- 
dependent hM,Di-mCherry. Indeed, chronic inhibition (injection of 
CNO every 12h for 8d) ameliorated the anorexia and prevented star- 
vation after AgRP neuron ablation (Fig. 3c, d). Taken together, these 
results demonstrate that inhibition of PBelo CGRP neurons increases 
food intake under conditions that normally suppress appetite. 

To examine the relevant efferent projections of PBelo CGRP neurons, 
we simultaneously injected two AAV vectors carrying either Cre- 
dependent mCherry or Cre-dependent synaptophysin-green fluor- 
escent protein (Syn-GFP) transgenes into the PBN of Calca“”* mice. 
We observed dense expression of mCherry- and GFP-positive fibres in 
the laterocapsular division of the central nucleus of the amygdala 
(CeAlc; Fig. 4a). This expression overlapped with immunohistochem- 
ical detection of CGRP in fibre terminals in the CeAlc (Fig. 4a) and 
is consistent with previous reports describing projections from the 
PBN to the central amygdala’®**’. To a lesser degree, we also observed 
expression of mCherry and Syn-GFP in the bed nucleus of the stria 
terminalis (BNST; Supplementary Fig. 12). We observed sparse expres- 
sion of mCherry and Syn-GFP in the lateral hypothalamus, medial 
thalamus and in the dorsal PBN; expression was notably absent from 
previously described satiety centres, such as the arcuate nucleus” or 
paraventricular hypothalamus”. To confirm the specificity of projec- 
tions from the PBelo to the CeAlc, we injected green fluorescent retro- 
beads into the CeAlc and AAV carrying Cre-dependent mCherry in 
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Figure 3 | Inhibition of PBelo CGRP neurons increases food intake during 
conditions that suppress appetite. a, AAV DIO hM,Di-mCherry transgene 
bilaterally injected into the PBN. b, Pharmacogenetic inhibition of CGRP 
neurons increases food intake after administration of anorexigenic compounds 
(n = 6 animals per group; experiment replicated at least five times per animal in 
three groups of animals). c, d, Chronic administration of CNO (every 12h for 
8 d) increases food intake (c) and prevents starvation (d) in Agrp?™’* mice 
after two injections of diphtheria toxin. The mCherry animals were killed on 
day 8 owing to extreme weight loss (1 =6-9 animals per condition; experiment 
replicated in three groups of animals). *P < 0.05, **P < 0.01, ***P < 0.001; see 
Supplementary Information for statistical analyses. 


the PBelo of Calca“”* mice. mCherry was expressed in more than 
95% of retrogradely labelled green fluorescent neurons (Supplemen- 
tary Fig. 13). 

To demonstrate functional connectivity between the PBelo and the 
CeAlc, we transduced PBelo CGRP neurons with Cre-dependent ChR2- 
mCherry. In vivo photostimulation of either the PBelo or downstream 
projections in the CeAlc resulted in an increase in Fos expression in the 
CeAlc (Supplementary Fig. 14a-f). In acute brain slices, optical stimu- 
lation of ChR2-mCherry-positive fibres in the CeAlc resulted in excitatory 
postsynaptic currents (EPSCs; Fig. 4b and Supplementary Fig. 14g, h) 
and an increase in firing rate (Supplementary Fig. 14i) in CeAlc neu- 
rons (11 out of 25 cells showed an optically evoked response). The 
EPSCs and the increase in firing rate were blocked by bath application 
of the glutamate receptor antagonists 6-cyano-7-nitroquinoxaline-2, 
3-dione (CNQX; 10 uM) and p(-)-2-amino-5-phosphonovaleric acid 
(AP5; 50 1M), indicating that PBelo CGRP neurons form an excitatory 
synaptic connection with neurons of the CeAlc (Fig. 4b and Supplemen- 
tary Fig. 14g-i). To determine the effect of stimulating PBelo-to-CeAlc 
projections on food intake, we transduced PBelo CGRP neurons with 
ChR2-mCherry and implanted fibre optic cannulae above the CeAlc 
(Fig. 4c). Photostimulation of projections into the CeAlc for 5 min at 
20-40 Hz significantly and reversibly suppressed food intake (Fig. 4d 
and Supplementary Fig. 15). In contrast, stimulation of PBelo projec- 
tions to the BNST for 5 min had no significant effect on food intake 
(Supplementary Fig. 16). Taken together, these results indicate that 
direct projections from PBelo CGRP neurons to the CeAlc are suf- 
ficient to reduce food intake. 

To determine the necessity of the PBelo-to-CeAlc projection in medi- 
ating appetite suppression, we bilaterally injected canine adenovirus 
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Figure 4 | Efferent projections from PBelo CGRP neurons to the CeAlc 
mediate appetite suppression. a, Left and middle, mCherry-expressing fibres 
from the PBelo to the CeAlc; scale bar, 500 um. Right, PBelo fibres expressing 
mCherry, synaptophysin-GFP and CGRP. b, EPSC from a CeAlc neuron upon 
photostimulation of PBelo-to-CeAlc fibres before (black) and after (red) bath 
application of CNQX and APS. c, AAV DIO ChR2-mCherry with fibre optic 
implant above the CeAlc. d, Photostimulation of PBelo fibres in the CeAlc 
reversibly inhibits food intake (m = 8 animals for each group; experiment 
replicated at least 20 times per animal in three groups of animals). e, Injection of 
CAV2 Cre into the CeAlc and AAV DIO hM,Di-mCherry into the PBN. 

f, Inhibition of retrogradely targeted PBelo neurons increases food intake 

(n = 6 animals for each group); experiment replicated at least five times per 
animal in three groups of animals. *P < 0.05, **P < 0.01; see Supplementary 
Information for statistical analyses. 


(CAV2) carrying Cre recombinase into the CeAlc. CAV2 is capable of 
efficient retrograde transport” and therefore will express Cre in upstream 
PBelo neurons. In the same animals, we also bilaterally injected AAV 
carrying a Cre-dependent hM,Di transgene into the PBN (Fig. 4e). 
Inhibition of PBN neurons with CNO decreased the suppression of 
appetite observed after injection of LiCl or LPS (Fig. 4f), demonstrating 
that activity in PBelo neurons projecting to the CeAlc is necessary 
for the normal suppression of appetite observed after injection of 
LiCl or LPS. 

Taken together, these results demonstrate a neural circuit from CGRP- 
expressing neurons in the PBelo to the CeAlc that mediates appetite 
suppression. One of our observations was that inhibition of PBN neu- 
rons increased food intake when mice did not normally eat (Figs 3b-d 
and 4f), but did not statistically increase food intake in baseline con- 
ditions (Supplementary Fig. 11) or after injection of amylin or CCK 
(Fig. 3b). These findings are consistent with those of Atasoy et al.”’, 
who did not find any effect on food intake after stimulation of inhib- 
itory projections from AgRP neurons to the PBN. Perhaps inhibitory 
projections from AgRP neurons do not stimulate food intake in base- 
line conditions but do decrease the suppression of appetite when the 
PBN is most active. Thus, ablation of AgRP neurons causes starvation 
in adult mice because of disinhibition in the PBelo’. Additionally, 
PBelo CGRP neurons may not mediate the ordinary, routine satiety 
experienced after a meal, but may mediate more severe forms of satiety 
experienced during severe overfeeding (gastric distention), illness or 
other conditions in which it is unhealthy to eat such as dehydration or 
vertigo. 

Although it is well-established that several hypothalamic and brain- 
stem nuclei coordinate appetite and satiety*°”, this study demon- 
strates the involvement of downstream circuitry that may mediate 
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the ‘unpleasant feeling’ or discomfort that results from adverse condi- 
tions during which it is unfavourable to eat. Indeed, the CeAlc is known 
to process polymodal information about the internal and external 
bodily environment including adverse visceral stimuli*®. Adding to 
previous results’”, we propose that PBelo CGRP neurons integrate 
visceral and energy balance information and communicate with the 
CeAlc to mediate extreme satiety and malaise (Supplementary Fig. 17). 


METHODS SUMMARY 
We generated Calca”* mice and Agrp?™* mice and backcrossed them onto a 
C57B1/6 background. AAV 1 vectors were stereotaxically injected into the PBN; in 
some experiments, CAV2 Cre was also stereotaxically injected into the CeAlc. For 
photostimulation experiments, mice were implanted with fibre optic cannulae 
above the PBN, CeAlc or BNST; blue light at 473 nm was delivered in 10-ms pulses 
at 20 mW intensity through a 1.5-m fibre optic cable. For pharmacogenetic manip- 
ulation, CNO was injected at 1 mgkg ', intraperitoneally. To ablate AgRP neu- 
rons, mice carrying the Agrp?™’* allele were injected twice with diphtheria toxin 
(50 pig kg” a intramuscularly, 2 days apart). LiCl (84 mgkg~ '. 0.2 M, 10 ml kg ay 
LPS from Salmonella typhimurium (50 wg kg~'), amylin (10 pgkg~') and CCK 
(10 pg kg” *) were all injected intraperitoneally as described in Supplementary Fig. 6. 
Food intake was monitored using lickometer cages supplied with water and liquid 
diet available ad libitum. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Mice. All experiments were approved by the University of Washington Institu- 
tional Animal Care and Use Committee and were performed in accordance with 
the guidelines described in the US National Institutes of Health Guide for the Care 
and Use of Laboratory Animals. We used exclusively heterozygous male CalcaC"”* 
and Agrp?'®”* mice backcrossed onto a C57BI/6 background, aged 7-9 weeks at 
the start of experimental procedures and no more than 18 weeks at the end of 
experimental procedures. Before stereotaxic surgery, mice were group housed and 
maintained with rodent diet (Picolab, number 5053) and water available ad libitum 
with a 12-h light:dark cycle at 22°C. After surgical procedures, mice were indi- 
vidually housed and maintained with a liquid diet (Vanilla Ensure, Abbot Labo- 
ratories) and water available ad libitum. We performed experiments on three or 
four experimental animals (for example, animals transduced with ChR2, hM3Dq, 
hM,Di) and three or four control animals (for example, animals only transduced 
with mCherry) at the same time to avoid differences in results between experi- 
mental sessions. Animals were randomly assigned to either the experimental or 
control groups in each litter. 

Generation of Calca“” mice. A 14-kb BstB1-Pacl fragment was isolated from a 
C57Bl/6 mouse BAC clone. A unique Sall site was introduced at the initiation 
codon of Calca in exon 2 by PCR. Next, a 7-kb Spel-Sall1 5’ fragment was cloned 
into a targeting vector containing a frt-flanked PkgNeo gene for positive selection 
and Pgk-DTA and HSV-TK genes for negative selection. This was followed by 
insertion of a 3’ 7-kb Sall-Pacl fragment. The gene encoding the Cre-GFP fusion 
protein with an amino (N)-terminal myc-tag and nuclear localization signal was 
cloned into the unique Xhol site adjacent to frt-PgkNeo. The targeting construct 
was linearized with Ascl and electroporated into G4 hybrid (C57Bl/6 X Sv129) 
embryonic stem cells. Eighteen correctly targeted clones were identified out of 96 
by Southern blot of EcoRV-digested DNA using a unique probe outside the tar- 
geting vector. Several of these clones gave good chimaeras when injected into 
C57B1/6 hosts. One chimaera was bred with FLPer (Rosa26-Flip recombinase) 
to remove the frt-Pgk-Neo gene. 

Virus production. Cre-dependent pAAV mCherry and ChR2-mCherry (driven 
by the Efl« promoter) DNA plasmids were provided by K. Deisseroth, and Cre- 
dependent pAAV hM;Dq-mCherry and hM,Di-mCherry (driven by the human 
synapsin promoter) DNA plasmids were provided by B. Roth. pAAV synapto- 
physin-GFP was generated by fusing the 3’ end of the mouse Synaptophysin 
coding region with the 5’ end of the GFP coding region. This sequence was then 
exchanged with mCherry in the pAAV mCherry plasmid to make the transgene 
Cre-dependent. Recombination-deficient AAV vectors were prepared in human 
embryonic kidney (HEK293T) cells with AAV1 coat serotype, purified by sucrose 
and CsCl gradient centrifugation steps, and re-suspended in 1 Hanks Balanced 
Saline Solution (HBSS) at a titre of approximately 2 x 10° viral genomes per 
microlitre. CAV2 Cre was prepared in dog kidney (DK/E1-1) cells, purified by 
sucrose and CsCl gradient centrifugation steps, and re-suspended in 1X HBSS ata 
titre of approximately 2.5 X 10° viral genomes per microlitre as described previously”. 
Viral aliquots were stored at —80 °C before stereotaxic injection. 

Stereotaxic surgery. At the start of surgical procedures, mice were anaesthetized 
with isofluorane and placed on a stereotaxic frame (David Kopf Instruments). 
Stereotaxic coordinates for the anterior—posterior plane were normalized using a 
correction factor (F = (Bregma — Lamda distance)/4.21) on the basis of the coor- 
dinates of Paxinos and Franklin’. Virus was injected unilaterally (on the left side) 
or bilaterally as described in the text either in the PBN (antero-posterior (AP), 
—4.9 mm; medio-lateral (ML), 1.4 mm; dorso-ventral (DV), 3.8 mm) or the CeAlc 
(AP, —1.2 mm; ML, 2.6mm; DV, 5.4mm) at a rate of 0.2 ul min ! for 2.5 min 
(0.5 pl total volume). Note that the viral injection coordinates target the most 
anterior aspect of the PBN but the virus diffuses posteriorly to hit all PBN sub- 
nuclei; this injection site improves the accuracy of injecting into the lateral PBN 
between the superior cerebellar peduncle (scp) and lateral wall of the pons. Also 
note that the presence of the scp fortuitously limits the spread of virus from the 
PBN region, thus preventing unintended transduction of other nearby Calca- 
expressing brain regions. 

After viral injection, mice used for optogenetic experiments also received uni- 
lateral surgical implantation of a Mono Fibreoptic Cannula (Doric Lenses), either 
above the PBN (AP, —5.2 mm; ML, 1.6 mm; DV, 3.0 mm), CeAlc (AP, —1.2 mm; 
ML, 2.6 mm; DV, 5.4mm) or BNST (AP, +0.14mm; ML, 1.0 mm; DV, 4.0 mm). 
Cannulae were affixed to the skull with C&B Metabond (Parkell) and dental 
acrylic. 

Slice electrophysiology. Coronal brain slices (250 jm) were prepared in an ice 
slush solution containing (in mM) 250 sucrose, 3 KCl, 2 MgSOu, 1.2 NaH2PO,, 
10 p-glucose, 25 NaHCO; and 0.1 CaCl). Slices recovered for 1h at 34°C in 
artificial cerebral spinal fluid (ACSF) continually bubbled with O2/CO, and con- 
taining (in mM) 126 NaCl, 2.5 KCI, 1.2 NaH ,PO,, 1.2 MgCl, 11 p-glucose, 18 
NaHCO; and 2 CaCl;. Whole-cell patch-clamp recordings were made using an 
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Axopatch 700B amplifier (Molecular Devices) with filtering at 1 kHz using 4-6 MQ 
electrodes filled with an internal solution containing (in mM) 120 CsMeSOs, 20 
HEPES, 0.4 EGTA, 2.8 NaCl, 5 Mg-ATP, 0.5 Na-GTP, pH 7.2-7.4, 280 mOsm (for 
CeAlc single EPSC recordings) or 135 KMeSO3, 10 KCI, 10 HEPES, 0.1 EGTA, 2.8 
NaCl, 5 Mg-ATP, 0.5 Na-GTP, pH 7.2-7.4, 280 mOsm (for PBN recordings and 
CeAlc high-frequency stimulus recordings). ACSF at 32 °C was continually per- 
fused over slices at a rate of approximately 2 ml min’ ' during recording. 

For photostimulation experiments in the CeAlc, neurons surrounded by mCherry- 
positive fibres were selected for recording. For light-evoked EPSCs, neurons were 
held in voltage clamp at —70 mV, and EPSCs were stimulated by 10-ms pulses of 
blue laser light (single pulse or 300 pulses at 30 Hz) using the same fibre optic as for 
the in vivo experiments, placed in the bath above the slice. Traces are averages of 
15-20 sweeps for single EPSCs or averages of three sweeps for 30 Hz trains, 
collected before and 5 min after bath application of CNQX (104M) and APS 
(50 1M). For CeAlc current clamp recordings, most CeAlc neurons had resting 
membrane potentials between —65 and —80 mV and did not fire action potentials 
spontaneously unless constant current was injected through the patch pipette to 
bring the cell closer to threshold. Trains of stimuli at 30 Hz were delivered as above. 

For PBN ChR2 recordings, neurons identified by fluorescence were recorded in 
current-clamp mode and trains of light pulses (1 ms) were delivered at the indi- 
cated frequencies for durations of 2 s. For pharmacogenetic experiments, neurons 
expressing hM,Di were identified by fluorescence. Cells that did not show spon- 
taneous firing were excluded. Neurons were recorded in current clamp mode in 
ACSF for a 5-min baseline period, followed by bath application of 3 4M CNO for 
3 min, then washout with ACSF. Baseline firing rate was calculated from a 2-min 
window immediately before CNO application. CNO and washout firing rates were 
calculated from 2-min windows surrounding the maximum CNO effect and the 
maximum recovery period, respectively. 

Food intake monitoring. For feeding assays, mice were individually housed in 
lickometer cages (Columbus Instruments) supplied with water and liquid diet 
(Vanilla Ensure, Abbot Laboratories) available ad libitum. Food and water ports 
were changed daily at the start of the dark cycle. The mice were allowed to 
acclimate to lickometer cages for 5 days and then baseline food intake was mea- 
sured for an extra 5 days before experimental procedures. Acute food intake mea- 
surements (Figs 2b, c, e, f, 3b and 4d, f) occurred at the onset of the active period 
(lights off). Long-term measurements (Fig. 2g, h and 3c, d) occurred over a 24-h 
period with total food-intake and body weight measured approximately 6 h before 
the onset of the active period. Measurements were performed by an investigator 
(M.E.C.) with knowledge of the identity of the experimental versus control groups 
(that is, without blinding). 

Photostimulation. After a 14-day recovery period following surgery, mice were 
individually housed in lickometer cages with open cage tops. Fibre optic cables 
(1.5m long, 200 1m diameter; Doric Lenses) coated with opaque heat-shrink 
tubing were firmly attached to the implanted fibre optic cannulae with zirconia 
sleeves (Doric Lenses). Mice were allowed at least 5 days to acclimate before 
experimental sessions. During photostimulation experiments, light pulse trains 
(10-ms pulses of various frequency; see text) were programmed using a waveform 
generator (Agilent Technologies, number 33220A) that provided input to a blue 
light laser (473 nm; LaserGlow). We adjusted the light power of the laser such that 
the light power exiting the fibre optic cable was 20 mW (160 mW mm ”); using an 
online light transmission calculator for brain tissue** (http://www.stanford.edu/ 
group/dlab/cgi-bin/graph/chart.php) we estimated the light power at the PBelo to 
be 36.2 mW mm *. Note that this is probably a high estimation because some light 
was probably lost at the interface between the fibre optic cable and the implanted 
fibre optic cannula. After the completion of photostimulation experiments, mice 
were perfused and the approximate locations of fibre tips were identified based on 
the coordinates of Paxinos and Franklin’. 

Pharmacological injections. Pharmacological compounds were prepared in sterile 
0.9% saline and stored at —20 °C before use. Diphtheria toxin was injected intra- 
muscularly and all other compounds were administered intraperitoneally as described 
in Supplementary Fig. 6. Compounds included amylin (10 ug kg _'; Bachem, num- 
ber H-9475), CCK-8 (10 pgkg~'; Bachem, number H-2080), CNO (1 mgkg ’; 
Sigma, number C0832), diphtheria toxin (50 pg kg; List Biologicals, number 
150), LiCl (84mg kg; 0.20 M at 10 mike; Fisher, number L121) and LPS, 
S. typhimurium (50 ug kg; Calbiochem, number 437650). 

Histology. Mice were anaesthetized with buprenorphine and perfused transcar- 
dially with 1X PBS, pH 7.4, followed by 4% paraformaldehyde in PBS. The brains 
were extracted, allowed to postfix overnight in the same fixative at 4 °C and cryo- 
protected in 30% sucrose dissolved in 1X PBS for an extra 24h at 4 °C. Each brain 
was sectioned at 30 tm on a cryostat (Leica Microsystems) and collected in cold 
1X PBS. 

For immunohistochemistry experiments, sections were washed three times in 
PBS with 0.2% Triton X-100 (PBST) for 10 min at room temperature. Sections 
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were then incubated in a blocking solution composed of PBST with 3% normal 
donkey serum (Jackson ImmunoResearch, number 017-000-121) for 1h. For 
primary antibody exposure, sections were incubated in rabbit anti-c-Fos (1:2000, 
Calbiochem, number PC38), rabbit anti-GFP (1:1000, Invitrogen, number A11122) 
and/or goat anti-CGRP (1:500, Abcam, number ab36001) in blocking solution at 
4°C for approximately 20h. After three 10-min washes in PBST, sections were 
incubated in Alexa Fluor 594 donkey anti-goat IgG (1:200, Jackson Immuno- 
research, number 705-858-147), Alexa Fluor 488 donkey anti-goat IgG (1:200, 
Jackson Immunoresearch, number 705-485-147), DyLight 405 donkey anti-goat 
(1:200, Jackson Immunoresearch, number 705-475-147) and/or Alexa Fluor 488 
donkey anti-rabbit IgG (1:200 Jackson Immunoresearch, number 711-545-152) in 
block solution for 1h at room temperature. Finally, sections were washed three 
times in 1X PBS. 

Sections were mounted in PBS onto SuperFrost Plus glass slides (VWR, number 
48311-703) and coverslipped with Dapi Fluoromount-G (Southern Biotech, number 
0100-20). Slides were stored in the dark at 4 °C before microscopy and image acquisition. 

Quantification of co-localization of Fos and mCherry in the PBN (Fig. 1c-j and 
Supplementary Fig. 7) was performed on adjacent sections from approximately 
Bregma —4.90 to —5.50 (exactly 21 sections per mouse). Quantification of Fos in 
the CeAlc (Supplementary Fig. 14a-f) was performed on adjacent sections from 
approximately Bregma —1.14 to — 1.26 (exactly five sections per mouse). A Fos- 
positive cell was considered located in the CeAlc if it was in the field of mCherry 
fluorescence in that particular section. An investigator (M.E.C.) blinded to the 
identity of the conditions used to induce Fos performed all quantification. 
Microscopy. Fluorescent and brightfield images were collected on either a Nikon 
upright epifluorescent microscope with a QImaging Camera (Figs 1b and 4a and 
Supplementary Figs 1 and 4) or a Zeiss LSM 510 Meta confocal microscope. 


Images were minimally processed using Photoshop CS5 (Adobe Systems) to 
enhance brightness and contrast for optimal representation of the data. Low- 
magnification brightfield images (Figs 1b and 4a and Supplementary Fig. 4) were 
montaged together to produce a single coronal section. All digital images were 
processed in the same way between experimental conditions to avoid artificial 
manipulation between different data sets. 
Statistics. We used an online power and sample size calculator to determine an 
effective sample size for statistical comparisons** (http://homepage.cs.uiowa.edu/ 
~rlenth/Power/). Assuming a standard deviation of 1.0 and a significance level of 
0.05, this calculator shows that with eight mice per group we had an 80% confid- 
ence level of achieving statistical significance between means of 1.5-fold with a 
two-tailed Student’s t-test. We excluded an animal from data analysis if flagged by 
a University of Washington veterinarian for health reasons during the experi- 
mental period or if post hoc histological analysis showed no viral transduction 
as indicated by an absence of mCherry fluorescence. 

All data were analysed using Prism 6.0 (GraphPad Software) as described in the 
text and Supplementary Statistical Analysis. Data were exported into Illustrator 
CS5 (Adobe Systems) for preparation of figures. 
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Dendritic spikes enhance stimulus selectivity in 


cortical neurons in vivo 


Spencer L. Smith!?, Ikuko T. Smith!?, Tiago Branco’? & Michael Hiiusser! 


Neuronal dendrites are electrically excitable: they can generate 
regenerative events such as dendritic spikes in response to suffi- 
ciently strong synaptic input'*. Although such events have been 
observed in many neuronal types*”, it is not well understood how 
active dendrites contribute to the tuning of neuronal output in vivo. 
Here we show that dendritic spikes increase the selectivity of neur- 
onal responses to the orientation of a visual stimulus (orientation 
tuning). We performed direct patch-clamp recordings from the den- 
drites of pyramidal neurons in the primary visual cortex of lightly 
anaesthetized and awake mice, during sensory processing. Visual 
stimulation triggered regenerative local dendritic spikes that were 
distinct from back-propagating action potentials. These events were 
orientation tuned and were suppressed by either hyperpolarization 
of membrane potential or intracellular blockade of NMDA (N-methyl- 
D-aspartate) receptors. Both of these manipulations also decreased 
the selectivity of subthreshold orientation tuning measured at the 
soma, thus linking dendritic regenerative events to somatic orienta- 
tion tuning. Together, our results suggest that dendritic spikes that 
are triggered by visual input contribute to a fundamental cortical 
computation: enhancing orientation selectivity in the visual cortex. 
Thus, dendritic excitability is an essential component of behaviou- 
rally relevant computations in neurons. 

Neuronal dendrites express voltage-dependent Ca~* and Na” chan- 
nels that confer electrical excitability, particularly the ability to support 
the active back-propagation of action potentials and the initiation of 
local dendritic spikes’. In addition, the voltage-dependent Mg”* block 
of synaptic NMDA receptors can also support nonlinear synaptic inte- 
gration and dendritic spike initiation®'®. These mechanisms of active 
synaptic integration have been probed extensively in vitro”. Dendritic 
spikes have also been observed in vivo under certain conditions® *; how- 
ever, it remains unclear whether they are involved in behaviourally rele- 
vant computations'’’. To investigate whether dendritic non-linearities 
can contribute to a well-known example of cortical computation, orien- 
tation tuning in the visual cortex'*, we made direct dendritic patch- 
clamp recordings from layer 2/3 neurons in the mouse visual cortex. 

To measure dendritic activity in vivo, whole-cell patch-clamp record- 
ings were obtained from the thin apical dendrites (diameter, 2.0 + 0.4 lum, 
mean + s.d.; n = 12) of layer 2/3 neurons in the mouse primary visual 
cortex (Fig. 1a) in lightly anaesthetized and awake mice. The patch- 
clamp pipette was used to fill cells with a fluorescently labelled Ca** 
dye (100 1M Oregon Green 488 BAPTA-1), for imaging Ca** transients, 
and a red fluorescent dye (25-50 LM Alexa Fluor 594), for imaging the 
morphology of the dendritic arbor and identifying the precise location 
of the dendritic recording (Fig. 1b). Dendritic recordings exhibited the 
expected physiological features, such as a high local input resistance 
that increased with distance from the soma’*'*® (Extended Data Fig. la-c). 

In somatic recordings, visual stimulation with drifting square-wave 
gratings evoked conventional action potential activity, with the firing 
rate tuned to the orientation of the stimulus'*”” (Fig. 1c). By contrast, 
recordings from distal dendrites (>75 1m from the soma) showed 


orientation-tuned, high-frequency bursts of Na” spikes riding on a 
depolarization envelope, a finding that is consistent with the activation 
of voltage-gated Ca”* channels and synaptic NMDA receptor currents 
(Fig. 1d-g and Extended Data Fig. 1d). The properties of these spikes 
were in contrast to those of isolated spikes (single spikes separated 
by at least 50 ms from other spikes), which are presumed to be back- 
propagating action potentials (bAPs; Fig. 1d, e), although not all bAPs 
are isolated bAPs. The isolated bAPs exhibited a uniform amplitude 
and shape within a recording, and they decreased in amplitude and 
increased in width with increasing distance from the soma’* (Extended 
Data Fig. le, f). In contrast to dendritic bursts, which can contain both 
local Na* spikes and bAPs, isolated bAPs provide a read-out of somatic 
activity that can be compared with local dendritic events. Visually 
evoked spike bursts that were recorded in distal dendrites were tuned 
to the orientation of the stimulus, with reliable trial-to-trial tuning 
(Fig. le, f). The preferred orientation of dendritic spike bursts did 
not differ from that of bAPs recorded in the same cell (difference in 
preferred orientation, 34.7 + 28.8°; n=9; P=0.22, paired t-test; 
Extended Data Fig. 2). 

We next sought to determine whether these dendritic events were 
local. Given that dendritic Na* spikes, a prominent feature of the den- 
dritic regenerative events we recorded, have fast kinetics, their wave- 
form is likely to be heavily attenuated by the cable-filtering properties 
of the dendritic arbor and thus not propagated efficiently to the soma 
(by contrast, the slow depolarization envelope that typically underlies 
dendritic regenerative events can spread more readily to the soma’’). 
The maximum instantaneous and mean spike rates, as well as the 
variance-to-mean ratio, were higher in distal dendritic recordings 
(>75 um; n= 9) than in proximal dendritic recordings (<50 lum; 
n= 5) or somatic recordings (Fig. 2a), suggesting that many of the 
individual spikes in the bursts observed in distal dendritic recordings 
are indeed local dendritic spikes and not bAPs"*. The spike statistics 
from somatic recordings and proximal dendritic recordings (<50 um 
from the soma; 1.7 + 0.3 um, mean diameter + s.d.; P = 0.14, two- 
sample t-test, not different from the diameter of distal dendrites 
recorded) were indistinguishable (P > 0.05, two-sample t-test), thus 
confirming that the dendritic recording configuration itself does not 
affect spike rates. These measurements were made in lightly anaesthe- 
tized mice, and similar spiking patterns were observed in dendritic 
recordings from awake mice (Fig. 2a, filled symbols, and Extended 
Data Fig. 3), demonstrating that such spiking is also present in the 
alert, behaving animal. These data indicate that, at distal dendritic 
recording sites, spiking occurs at higher rates and with a higher degree 
of ‘burstiness’ than spiking recorded at the soma, and they provide the 
first piece of evidence that these dendritic events are local. 

A second piece of evidence indicating the dendritic origin of these 
events is provided by the onset, or foot, of the spike waveform. One 
signature of propagated spikes is a sharp inflection at the foot of the 
spike, in contrast to the smooth rise observed near the site of spike 
initiation’®. Isolated spontaneous spikes exhibited a clear ‘kink at their 
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Figure 1 | Dendritic patch-clamp recordings from visual cortex pyramidal 
neurons in vivo. a, Schematic of the recording and imaging set-up for in vivo 
dendritic patch-clamp recordings with two-photon microscopy. b, A two- 
photon microscopy image of a layer 2/3 pyramidal neuron in the mouse visual 
cortex in vivo, obtained by filling with Alexa Fluor 594 and dendritic patch- 
clamp recording at 100 um from the soma (maximum intensity projection) 
(scale bar, 20 um). ¢, d, Square-wave grating visual stimuli evoked somatic 
action potentials (c) and dendritic activity (d) that exhibited reliable 
orientation-tuned burst-spiking events. e, The spikes within dendritic burst 
events were highly variable compared with the more stereotyped bAPs and 
somatically recorded action potentials. f, The frequency of dendritic burst 
events varied with the orientation of the square-wave grating visual stimuli. 
g, Individual burst events were highly variable in amplitude and kinetics 
(dendritic recording 150 tm from the soma). 
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200 ms 


onset, as expected for bAPs (Fig. 2b). Spikes within bursts, by contrast, 
generally exhibited a much slower onset, indicative of local generation 
(Fig. 2c). The membrane potential at spike initiation did not comple- 
tely predict the spike onset speed (Extended Data Fig. 4). Thus, the 
depolarization envelope during dendritic bursts, which can also reduce 
the speed of spike onset, did not by itself account for the difference in 
spike shape. Although these spikes were heterogeneous and were prob- 
ably a mixed population containing some bAPs, spikes within bursts 
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Figure 2 | Visually evoked dendritic burst events are local. a, Whole-cell 
patch-clamp recordings were performed at the soma (n = 9), proximal 
dendritic (proximal dend.) locations (<50 um from the soma; n = 5) or distal 
dendritic (distal dend.) locations (>75 jum from the soma; n = 9). The 
frequency of stimulus-evoked dendritic burst events at the soma and the 
proximal dendritic locations were similar; however, the event frequency and 
burstiness (variance-to-mean ratio of the spike count distribution, or Fano 
factor) were significantly higher at the distal dendritic locations (P values from 
the Wilcoxon rank sum test; comparing somatic and distal dendritic 
recordings). Data from recordings in awake mice (n = 6 total) exhibited the 
same trends as in lightly anaesthetized mice. Error bars indicate s.e.m. b, The 
inflection in membrane potential (V,,,) for isolated back-propagated action 
potentials recorded at distal dendritic locations (>100 tum from the soma) 
exhibited a sharp kink, consistent with propagated spikes. This is visible both in 
voltage versus time plots (left) and dV/dt (rate of V, change, where t is time) 
versus voltage phase plots (right). The insets show magnifications of the onset 
at early time points. c, By contrast, spikes in bursts exhibited a slower onset, 
consistent with local generation. d, Across the population of dendritic 
recordings, isolated spikes consistently exhibited sharper inflections at onset 
(measured as the initial slope in the dV/dt versus voltage phase plots; dashed 
lines in b and c) than did spikes in bursts. Distal dendritic recordings from 
awake mice exhibited the same trend as in lightly anaesthetized mice. 


consistently exhibited a slower onset, across the population, than did 
isolated bAPs (P = 0.0013, Wilcoxon rank sum test; n = 7 anaesthe- 
tized mice; data from awake mice exhibited the same trend; Fig. 2d), 
consistent with their identity as locally generated dendritic spikes. 
To obtain a third piece of evidence, anda more direct read-out of the 
relationship between dendritic spikes and somatic action potentials, 
we used two-photon Ca** imaging to simultaneously measure Ca** 
fluctuations at the soma during dendritic patch-clamp recordings, 
allowing us to infer somatic action potential activity during dendritic 
patch-clamp recordings. Spike bursts recorded at distal dendritic sites 
were correlated with somatic Ca~” signals (Fig. 3a). As was the case 
with the tuning of dendritic bursts and bAPs, the preferred orientation 
of dendritic bursts (Fig. 3b) was similar to the preferred orientation of 
the somatic Ca** signal (Fig. 3c), with qualitative differences in the 
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Figure 3 | Simultaneous dendritic recording and Ca?* imaging at the soma 
shows that dendritic spikes are local. a, To infer spiking activity at the soma 
during dendritic patch-clamp recordings, neurons were filled with a Ca”* 
indicator and somatic Ca** signals were imaged. During simultaneous 
dendritic voltage recordings and somatic Ca”* imaging, dendritic bursts were 
sometimes not accompanied by robust somatic Ca’ * signals (indicated with a 
heavy arrow). Arrows at the top indicate the orientation of the visual stimulus 
grating, with shaded boxes indicating the duration of stimulus presentation. 
b, c, Dendritic bursting was well tuned (b) and overlapped with the orientation 
tuning at the soma (c). (The data for Cell 1 are shown ina andb.) d, To calibrate 
the Ca** signals seen at the soma, we calculated spike-triggered averages of the 
somatic Ca~" signal for well-isolated single spikes and bouts of four to five 


tuning curves consistent with the interpretation that a single dendrite 
contributes only a portion of the input that drives a neuron to fire. To 
calibrate these measurements, we used separate somatic recordings 
combined with somatic Ca** imaging. The somatic Ca”* signal observed 
during a dendritic burst of four to five spikes was indistinguishable 
from the Ca’" signal observed during a single action potential recorded 
at the soma (P = 0.77, two-sample t-test; Fig. 3d). Furthermore, although 
the somatic Ca” signals became saturated near the same level in both 
dendritic and somatic recording configurations (fluorescence change 
over baseline AF/Fy ~ 30% for both configurations; not different 
between the two configurations, P = 0.99, two-sample t-test), this sat- 
uration level was reached with only 10 spikes recorded at the soma but 
needed more than 40 spikes in the dendrite (P< 10°, the two distri- 
butions are different, multivariate Kolmogorov-Smirnov test; n = 5; 


| Peak AF/F, (%) 


i rr i 


10% AF/F, 


0.5s 


Number of somatic APs/dendritic spikes recorded 


spikes (within 640 ms). The somatic Ca** signal amplitude was similar for 
single spikes regardless of whether the spikes were recorded at the soma or the 
dendrite. No difference in Ca’* signal amplitude was found between bouts of 
four to five spikes recorded at the dendrite and single spikes recorded at the 
soma, suggesting that some dendritic spikes were local. e, Across the population 
of events, even though the Ca” signals became saturated at about the same 
magnitude (fluorescence change over baseline AF/Fy ~ 30%; not different 
between the two configurations, P = 0.99, two-sample t-test), the somatic Ca2t 
signal as a function of the number of dendritic spikes rose much more slowly 
than that for the somatically recorded spikes (left). Data from responses with 
fewer than ten spikes are presented magnified (right). 


Fig. 3e). Thus, many of the individual spikes in bursts recorded at the 
dendrite were probably locally generated, rather than each reflecting a 
bAP, and resulted in subthreshold depolarizations at the soma. Measure- 
ments of Ca’* signals at the site of distal dendritic recordings (Extended 
Data Fig. 5a) provided further evidence that individual dendritic spikes 
were locally generated, by showing that dendritic bursts with Ca** 
signals spanning all of the visible dendritic branches (Extended Data 
Fig. 5b) contained spikes with steeper onsets than did bursts with Ca7* 
transients that were confined to the local recording site and not 
observed in adjacent branches” (n = 78 dendritic bursts; Extended 
Data Fig. 5c-f). Together, these results demonstrate how dendritic 
spikes, local Ca”* signals and bAPs interact during visual processing. 

A final piece of evidence that dendritic bursts consist of locally 
generated dendritic spikes is that visually evoked dendritic bursts were 
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Figure 4 | Hyperpolarization decreases the frequency of dendritic bursts. 
a, An example dendritic recording shows that compared to control conditions 
(left), hyperpolarization (right) decreased the mean frequency of bursts (from 
0.32 Hz to 0.14 Hz; —68%) more than the mean frequency of bAPs (0.51 Hz to 
0.36 Hz; —30%). b, Population data showing that bursts and bAPs were both 
suppressed (P = 3.0 X 10 © and P = 0.0004, respectively, t-test) and that bursts 
were suppressed to a greater degree by hyperpolarization than were bAPs 

(P = 0.005, paired t-test). Bars indicate population means and error bars 
indicate s.e.m. for the decrease in mean rates of bAPs and dendritic bursts 
during hyperpolarization compared with the control condition. Data from 
individual dendritic recordings are shown between the two bars, with lines 
connecting data points from the same recording. 


more sensitive to local dendritic hyperpolarization than were bAPs. 
When a steady hyperpolarizing current was delivered through the 
dendritic patch-clamp pipette, the rate of burst events decreased more 
than did the rate of isolated bAPs (P = 0.005, paired t-test; Fig. 4), 
indicating that the dendritic burst-generating mechanisms were more 
sensitive to the local membrane potential than were the mechanisms 
that support back-propagation of somatic action potentials. Taken 
together, these findings indicate that the spike bursts recorded at the 
dendrite during visual processing are not a pure population of bAPs 
but rather consist mainly of locally generated dendritic spikes. 

We next sought to uncover how dendritic regenerative events evoked 
by visual stimulation might influence the orientation tuning of neuronal 
output. The prolonged depolarization envelope of dendritic bursts can 
propagate to the soma and influence axonal output. If dendritic bursts, 
and the somatic depolarizations they cause, are required for normal 
synaptic integration during visual processing, then blocking them might 
disrupt an important cortical computation: orientation tuning. Layer 2/3 
neurons exhibited spiking that was robustly orientation tuned”, as well 
as subthreshold responses (Fig. 5a—d; even when the cells fired few to 
no spikes, Extended Data Fig. 6). The subthreshold tuning closely 
matched the spike-based tuning in terms of preferred orientation” 
(Pearson’s correlation coefficient R = 0.83, P< 10 °; Extended Data 
Fig. 7a) and orientation selectivity (as measured by the membrane 
potential orientation selectivity index, V,,OSI; Pearson’s R = 0.88, 
P = 0.0040; Extended Data Fig. 7b). This subthreshold tuning provides 
a way to compare synaptic integration in the control condition and 
the hyperpolarized condition where dendritic spikes are prevented 
(Fig. 4). Hyperpolarization degraded this subthreshold orientation tuning 
both in terms of the modulation amplitude (Fig. 5e, f) and the V,,OSI 
(Fig. 5g), without changing the preferred orientation (Pearson’s R = 0.88, 
P= 0.022; n = 10; Extended Data Fig. 7c, d). This effect was not accounted 
for by changes in the driving force for Cl” ions (Pearson’s R = 0.34, 
P = 0.15; Extended Data Fig. 8). These results demonstrate that voltage- 
gated mechanisms are required for normal synaptic integration and 
subthreshold orientation tuning during visual processing, which is 
consistent with the hypothesis that dendritic regenerative events enhance 
subthreshold orientation tuning. 

The synaptic NMDA receptor current, which is subject to voltage- 
dependent Mg”~ block, is a prime candidate for linking synaptic input 
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Figure 5 | Dendritic mechanisms contribute to the selectivity of 
subthreshold orientation tuning. a, b, Somatic whole-cell recording from a 
layer 2/3 pyramidal neuron, exhibiting robust visually evoked spiking (three 
sweeps overlaid; stimulus duration indicated by the grey bars) (a) and 
subthreshold responses (50-ms windows around the spikes were blanked, and 
the same results were obtained when the blanking window was decreased to 
20 ms) (b). ¢, d, The responses in a and b were orientation tuned as indicated by 
spike rasters (c) and polar plots of maximal depolarization (d). 

e-g, Hyperpolarization (hyperpol.) decreased the amplitude of the stimulus- 
evoked membrane potential modulation and decreased its tuning selectivity 
(e): it decreased V,,, modulation amplitude (f) and decreased the V,, orientation 
selectivity index (OSI) (g) across the population. h-j, To investigate the 
mechanisms involved in the dendritic spikes contributing to orientation tuning, 
whole-cell somatic patch-clamp recordings were performed with 1 1M MK- 
801 in the pipette solution. h, The orientation tuning selectivity progressively 
decreased during the recording as MK-801 diffused into the cell and blocked 
NMDA channels. In these example traces, the response to the preferred 
orientation decreased from early in the recording to late in the recording. 

i, Across the population, subthreshold orientation tuning was strongly 
inhibited at late time points in the recording compared with early time points. 
j, In control recordings, with no blockers in the pipette solution, the orientation 
tuning selectivity did not significantly change across the recording and was not 
significantly different from the early period of MK-801 recordings (P > 0.05, 
two-sample t-test). Bars in f-i and j represent population means, and error bars 
indicate s.e.m. Between the bars, the individual data points are shown, and lines 
connect data taken in the same recording. 


to regenerative events in dendrites'®’. Visually driven synaptic input 
to layer 2/3 pyramidal cells activates synaptic NMDA receptors”. To 
determine whether NMDA receptors form one of the voltage-dependent 
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mechanisms that contributes to nonlinear synaptic integration in vivo, 
we used whole-cell somatic recordings with a use-dependent, intracel- 
lular NMDA receptor blocker, MK-801, included in the pipette solu- 
tion”. Intracellular MK-801 did not affect up and down state dynamics 
(Extended Data Fig. 9a, b), confirming that the manipulation was 
restricted to the recorded cell, and dendritic spiking in the recorded 
cell was blocked by MK-801 (Extended Data Fig. 9c, d). Early in these 
recordings, before effective NMDA receptor block, the subthreshold 
orientation tuning was normal; however, late in these recordings, after 
MK-801 had diffused through the dendritic tree, V,,OSI decreased 
markedly (Fig. 5h-j), as did spike-based tuning (OSI early in recording, 
0.82 + 0.12; late in recording, 0.45 + 0.17; P = 0.016, paired t-test; n = 5; 
Fig. 5i, j). Because the orientation selectivity was degraded despite 
the remaining, unblocked, receptor pathways (for example, AMPA 
(o%-amino-3-hydroxy-5-methyl-4-isoxazole propionic acid) receptors 
and metabotropic glutamate receptors), we conclude that NMDA receptor 
currents are crucial for the tuning of synaptic integration in these 
neurons and that their depolarization-dependent relief from Mg”* 
block might be a key mechanism that links synaptic input and dendritic 
regenerative events, including dendritic spikes*””. 

We have presented multiple lines of evidence, including direct 
dendritic recordings, showing that visually evoked sensory input trig- 
gers local dendritic spikes in cortical pyramidal neurons in vivo. 
Moreover, we have shown that hyperpolarization decreases both the 
rate of dendritic spiking and V,,OSI, thus demonstrating that voltage- 
dependent dendritic mechanisms help shape the input-output func- 
tion of neurons during sensory processing”. Finally, we have also 
provided evidence for at least one specific biophysical mechanism 
that could underlie these dendritic events: NMDA receptor current. 
Together, these results show that the synaptic integration of sensory 
input crucially relies on voltage-dependent dendritic mechanisms. 

A detailed compartmental model of layer 2/3 pyramidal cells con- 
firmed that a biophysically plausible model can account for our data 
(Supplementary Note 1, Extended Data Fig. 10 and Supplementary 
Videos 1-3). This model reproduced the key findings of this study and 
offers potential mechanistic insight. For example, passively propagated 
Na” spikes can arrive immediately after an actively generated spike on 
a dendrite and support instantaneous spike rates of several hundred 
Hertz, as observed in distal dendritic recordings (Fig. 2a). In addition, 
cooperative recruitment of NMDA receptor current provides a crucial 
link between synaptic input and fast dendritic spiking. These results 
show that basic biophysical mechanisms support the electrophysio- 
logical phenomena that we observed in direct dendritic recordings in vivo. 

Our experimental results and compartmental modelling suggest 
that synaptic input causes a dendritic depolarization that activates 
voltage-dependent ion channels and relieves the Mg~* block of NMDA 
receptors”’®. This results in a supralinear, local regenerative event 
that includes dendritic Na“ spikes. The slow time course of the NMDA 
receptor current component of the regenerative events causes a pro- 
longed depolarization envelope that propagates to the soma and enhances 
axonal output. Thus, local computational subunits generated by voltage- 
dependent mechanisms in dendrites” are activated by sensory input 
in vivo, provide an orientation-tuned signal to the soma and thereby 
help determine stimulus selectivity. 

Dendritic regenerative events provide a mechanism by which a 
relatively small number of inputs can drive spike output**”*, changing 
the effective connectivity between local functional groups of neurons” 
or mitigating the noise in cortical circuits”’, by ensuring that variable 
synaptic input can result in a more reliable postsynaptic response. Our 
data from experiments on awake mice demonstrate that these dend- 
ritic events occur during alert sensory processing”®. Because dendrite- 
targeting inhibitory interneurons are inhibited in awake mice during 
sensory stimulation”, this circuitry might have a key role in gating 
sensory input*’. Overall, our results demonstrate that dendrites are not 
passive integrators of sensory-driven input in vivo. Rather, sensory 
input engages dendritic voltage-dependent mechanisms and thereby 
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generates local regenerative events and dendritic spikes, which have an 
important role in shaping orientation selectivity, a quintessential cor- 
tical computation. 


METHODS SUMMARY 


Wild-type C57BL/6 mice (24-56 days old, both male and female) were anaesthe- 
tized with isoflurane (5% for induction, 1.5-2.5% for surgery and 0-0.5% during 
recording), augmented with chlorprothixene (0.5-2 mgkg '). Experiments with 
awake mice were carried out with head fixation, during which mice could freely 
move their limbs, groom and drink a sweet liquid reward when offered. Square- 
wave gratings (0.04 cycles per degree, 2 cycles s_') were displayed ona liquid crystal 
display (LCD) screen to map orientation selectivity. The screen was shrouded with 
acone up to the eye of the mouse to prevent contamination of the imaging pathway 
with light from the visual stimulus. For both dendritic and somatic recordings, the 
pipette solution contained 135 mM or 140 mM K* methylsulphate, 4mM or 10mM 
KCI, 10 mM HEPES, 10 mM Na)-phosphocreatine, 4mM Mg-ATP, 0.3 mM Nap- 
GTP, 0.1 mM Oregon Green 488 BAPTA-1 and 0.025-0.050 mM Alexa Fluor 594; 
pH adjusted with KOH to 7.2; 290mmolkg '. Pipette resistances ranged from 
4.9 MQ to 11 MQ (mean, 7.9 MQ) for dendritic recordings and from 5 MQ to 
8 MQ (mean, 6.9 MQ) for somatic recordings. All recordings were from the apical 
dendrites of superficial layer 2/3 pyramidal neurons. Unlike layer 5 neurons or 
deeper layer 2/3 neurons, these neurons do not have a prominent apical trunk and 
tuft; therefore, the recorded dendrites were geometrically similar to basal den- 
drites. A custom-built two-photon microscope with a 16 magnification and 
0.8 numerical aperture water immersion objective (Nikon) and a large aperture 
collection pathway was used to visualize neurons. In dendritic recordings, putative 
bAPs were automatically identified as single spikes when isolated from other spikes 
by at least 50 ms. Unless otherwise specified, all measurements are expressed as 
mean + s.e.m. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Preparation. All experiments were carried out in accordance with the regulations 
of the UK Home Office or the guidelines and regulations of the US Department of 
Health and Human Services and the University of North Carolina. Wild-type 
C57BL/6 mice (24-56 days old, male and female) were anaesthetized with isoflurane 
(5% for induction, 1.5-2.5% for surgery and 0-0.5% during recording), augmented 
with chlorprothixene (0.5-2 mgkg '). During recording, this regime resulted in a 
sedated state in which the animals did not voluntarily move but responded to a toe 
pinch. Increased levels of isoflurane (>0.5%) typically suppressed spiking activity, 
including dendritic bursts. Experiments with awake mice were carried out with 
head fixation, during which mice could routinely move their limbs, groom and 
drink a sweet liquid reward when offered. 

After gluing a head plate to the skull for head fixation during surgery and 
recording, a 2-3-mm diameter craniotomy was performed over the monocular 
visual cortex. A thin layer of agar (1.5%) was dissolved in artificial cerebrospinal 
fluid (150 mM NaCl, 2.5 mM KCl, 10 mM HEPES, 2 mM CaCl, and 1 mM MgCl; 
pH adjusted with NaOH to 7.3; 300 mmol kg” ') and placed on top of the brain to 
help dampen movement. A homeothermic heat pad was used to maintain the body 
temperature within the physiological range, and a water-based ophthalmic oint- 
ment was used to maintain eye health. 

Visual stimulation. Visual stimulus presentation was controlled by routines writ- 
ten in MATLAB (MathWorks) using the Psychophysics Toolbox extensions*'”. 
Square-wave gratings (0.04 cycles per degree, 2 cycless_') of black (2cdm~*) and 
white (86 cdm ”) bars in eight different orientations were displayed on a liquid 
crystal display (LCD) screen (ESAW 7-inch VGA and composite TFT monitor, set 
at 1024 X 768 resolution and a 60 Hz refresh rate) to map orientation selectivity. 
The screen was shrouded with a cone up to the eye of the mouse to prevent 
contamination of the imaging pathway with light from the visual stimulus. The 
visual stimulus extended from +20° to +124” in azimuth and from — 10° to +42° 
in elevation. Visual stimuli were presented in a shuffled order: each orientation was 
presented once per sweep of eight orientations, and the order of the orientations 
was different in each sweep. 

Patch-clamp recordings. For both dendritic and somatic recordings, the pipette 
solution contained 135 mM or 140 mM K* methylsulphate, 4 mM or 10 mM KCl, 
10 mM HEPES, 10 mM Na3-phosphocreatine, 4 mM Mg-ATP, 0.3 mM Na)-GTP, 
0.1 mM Oregon Green 488 BAPTA-1 and 0.025-0.050 mM Alexa Fluor 594; pH 
adjusted with KOH to 7.2; 290 mmol kg” ’. Pipette resistances ranged from 4.9 MQ 
to 11 MQ (mean, 7.9 MQ) for dendritic recordings and from 5 MQ to 8 MQ (mean, 
6.9 MQ) for somatic recordings. For dendritic recordings, imaging’’ was used to 
guide the pipette away from blood vessels and somata, and tip resistance measured 
using a voltage step in voltage-clamp mode was used to detect contact with a 
dendrite. Shadowpatching techniques*’ were used to directly target the pipette 
to the soma. In somatic hyperpolarization experiments, cells were hyperpolarized 
by 214+ 114mV (mean = s.d.; from a membrane potential of —49.4 + 6.7 mV 
to —70.8 +9.4mV, mean + s.d.; average Vy across all stimuli after removing 
spikes). All dendritic recordings were from the apical dendrites of superficial layer 
2/3 pyramidal neurons. These neurons do not have a large apical trunk and tuft, 
unlike layer 5 or deeper layer 2/3 neurons ; therefore, the recorded dendrites were 
geometrically similar to basal dendrites. The series resistance was 39 + 5 MQ and 
34 + 5 MQ for dendritic and somatic recordings, respectively. The series resistance 
did not change as a function of dendritic distance from the soma (Pearson’s 
R= —0.24, linear regression, P = 0.50; n = 13). The bridge was rebalanced as 
needed during the recording. Reerics increased less than 2% over the time it took 
to run a tuning curve. In one case, we obtained two distal dendritic recordings in 
the same mouse. In all other cases, we obtained at most one distal dendritic 
recording per mouse. 

Compartmental modelling. Simulations were performed with the NEURON 
simulation environment (version 7.2) using a detailed reconstruction of a biocytin- 
filled layer 2 pyramidal neuron (NeuroMorpho.org ID Martin, NMO_00904). 
Passive parameters were C,, = 1 uF cm’, Ry = 7,0002 cm’, R, = 100 Q cm, yield- 
ing a somatic input resistance of 110 MQ, similar to the experimentally measured 
mean value (Extended Data Fig. 1a). AMPA, NMDA and GABA, (y-aminobutyric 
acid A) synapses had a peak conductance of 1 nS and were modelled as a bi-exponential 
function, with time constants of AMPA tl = 0.1 ms, AMPA 12 = 1 ms, NMDA 
tl =2ms, NMDA 12 =20ms, GABA,tl = 0.1ms and GABA, t = 4 ms (the 
inhibition reversal potential was set to -80mV). The Mg** block of NMDA 
synapses was modelled according to Jahr and Stevens**. Active conductances were 
introduced in the soma and in all dendritic compartments and included the follow- 
ing: voltage-activated Na‘ channels (soma 100 mS cm 7 and dendrite 60 mS cm *, 
distributed as a hot spot in the centre of each branch)**; high-voltage activated Ca** 
channels (soma 0.05 mS cm * and dendrite 0.05 mS cm *); low-voltage activated 
Ca** channels (soma 0.3 X 10°? mS cm? and dendrite 0.15 X 10°? mScm ”); 
voltage-activated K* channels (10mScm” soma and 0.3mScm * dendrite); 
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M-type K* channels (soma 0.22 mS cm? and dendrite 0.1 mS cm”); and Ca?*- 
activated K* channels (soma 0.3 mScm 7” and dendrite 0.3 mS cm ”). The time 
course of dendritic Ca** was modelled by adding the current carried by voltage- 
activated Ca** channels and the calcium fraction of the NMDA current (10%)*° 
and convolving it with a decaying exponential with a 50 ms time constant. AMPA and 
NMDA components were colocalized and coactivated at each excitatory synapse. 
A total of 1,100 synapses were randomly distributed across the dendritic tree: 
80% excitatory and 20% inhibitory. The synapses were separated into background 
and signal synapses (the signal synapses constituted 10% of the total synapse 
number) and were activated with independent Poisson spike trains delivered to 
each synapse. The background synapses were continuously activated at a mean 
rate of 0.5 Hz, and the signal synaptic input was activated at 5-8 Hz after a 200 ms 
baseline and stayed active for 200 ms. The results did not depend on the total 
number of synapses but rather the total input rate. For example, 100 synapses 
activated at 1 Hz yielded the same result as 1,000 synapses activated at 0.1 Hz (the 
model did not include short-term plasticity). 
Imaging. A custom-built two-photon microscope using galvanometer-based scan 
mirrors (6-mm diameter, Cambridge Technologies) with a 16% magnification, 0.8 
numerical aperture water immersion objective (Nikon) and a large aperture col- 
lection pathway with low-noise photomultiplier tubes (models 3896 and 7422-40P, 
Hamamatsu) was used to image neurons. Frame scans (15.6 frames s ') and line 
scans (1 ms per line) were acquired using ScanImage”’. 
Analysis. Custom programs written for MATLAB and IGOR Pro (WaveMetrics), 
including event detection and analysis routines in IGOR Pro by T. Ishikawa (Jikei 
University), were used for analysis. Input resistance was measured as the steady- 
state membrane potential in response to a current step. Maximum instantaneous 
firing rates were computed as the reciprocal of the smallest interspike interval. In 
dendritic recordings, putative bAPs were automatically identified as single spikes 
when isolated from other spikes by at least 50 ms. 
Statistics. Unless otherwise specified, all measurements are expressed as mean + 
s.e.m. The Shapiro-Wilk test was used, before the application of two-sided para- 
metric tests, to confirm that the deviation of the data from normality was statistically 
insignificant. For non-normal data, or data with significantly different variances, 
non-parametric tests were used as noted in the text (for example, the Wilcoxon 
rank sum test used in Fig. 2). The multivariate Kolmogorov-Smirnov test was 
computed using the generalization of Fasano and Franceschini’*. Experimental 
manipulations (hyperpolarization and MK-801 block of NMDA receptors) did not 
involve randomization or blinding because each recorded cell served as its own 
control (control versus hyperpolarized or early versus late recording, respectively). 
Sample sizes were designed to reliably measure neurophysiological parameters 
while remaining in compliance with ethical guidelines to minimize the number 
of animals used. The results from Fig. 2a remain statistically significant (P < 0.05) 
when the top one or two data points from the distal dendritic recordings are 
omitted from the analysis. To test the statistical significance of the orientation 
tuning of dendritic bursts, we first computed the OSI of the measured dendritic 
burst firing. We then shuffled the responses so that their associated orientation was 
assigned at random. We then computed the OSI of the new, reshuffled, responses. 
This procedure was repeated 1,000 times. The P value was taken as the fraction of 
time for which a randomly shuffled response yielded an OSI equal to or greater 
than the actual response. 
Orientation tuning analysis. The OSI was computed after fitting a sum of two 
Gaussian curves to the orientation tuning curve (whether based on spiking or 
membrane potential), with the centres 180° apart: 


where R, is the response to the preferred orientation and R, is the response at the 
orthogonal orientation. Cells in which the sum of squared residuals between the 
Gaussian fit and the observed data was less than 15% of the maximum value of 
the Gaussian fit were considered to be well described by the Gaussian fit. 

Membrane potential (V,,,) modulation in response to visual stimuli was com- 
puted by taking the peak-to-trough amplitude of the cycle average at 2 Hz (the drift 
rate of the visual stimulus gratings) (after removing any spikes). This modulation 
can be seen in the traces in Fig. 5c, particularly at 135° and 315°. 
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Extended Data Figure 1 | Electrophysiological features of layer 2/3 
dendrites in vivo. a, The input resistance of distal dendrites was typically 100- 
300 MQ but sometimes larger (up to 600 MQ). The input resistance increased 
as function of dendritic distance from the soma, approximately doubling every 
300 um. The grey point indicates the input resistance measured in somatic 
patch-clamp recordings (mean + s.e.m.). b, During a dendritic recording at 
150 um from the soma, hyperpolarizing current steps did not reveal a voltage 
sag; thus, there is probably little to no hyperpolarization-activated cation 
current, J), in the dendrites of layer 2/3 pyramidal neurons in vivo. c, The peak 
voltage response plotted against the hyperpolarizing current step amplitude in 


200 ms 


an I-V plot was well fit by a linear function, confirming the lack of I. 

d, Representative dendritic bursts evoked by visual stimulation at the optimal 
orientation in nine different dendritic recordings at progressively increasing 
distances from the soma. All right-hand scale bars are 20 mV. e, f, Compared 
with action potentials recorded at the soma, bAPs had a lower amplitude 

(e) and were prolonged in time (f), and both of these trends were more 
pronounced with increasing dendritic distance from the soma (error bars, s.d.). 
Both the amplitude and width were significantly different among the three 
groups (P < 0.01, unpaired t-tests with the Bonferroni correction for multiple 
comparisons). 
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Dendritic bursts 
bAPs 


Cell 1 Cell 2 Cell 3 Cell 4 
Max. burst rate (s") 0.75 3.4 0.88 0.88 
Max. bAP rate (s"): 1.0 11 26 65 
Difference: oy" 5.8° 2a" 83.2° 
(in preferreed 
orientation) 


Extended Data Figure 2 | Orientation tuning curves of dendritic bursts 
compared with bAPs. Tuning curves for dendritic spike bursts and bAPs 
recorded at distal dendritic locations (>75 tm from soma) are shown. The 
tuning curves for dendritic bursts match the tuning curves for isolated bAPs. 
The statistical significance of dendritic burst tuning curves was tested by 
randomly shuffling responses (details in the Methods) and was found to be 
significant (P < 0.05) for 7 out of 9 cells (dendritic burst tuning in cells 6 and 9 
was not significant). The curves were normalized to the maximal values, which 
are shown at the bottom right of each polar plot. The small qualitative 
differences may be due to dendrites that are topologically distant from the 


Cell 5 Cell 6 Cell 7 Cell 8 Cell 9 
28 1.4 Aad 0.75 1.4 
23.4° 55.5° Ae 0.16° 0.13° 


dendritic recording site exhibiting slightly different tuning curves. The grating 
drift direction that elicited the largest response is indicated with an arrow. The 
difference between these directions is indicated at the bottom of each polar plot. 
The cross correlation between dendritic bursts and isolated bAPs was highly 

significant: Pearson’s R = 0.54, P = 0.000013, paired t-test; n = 9. When only 
the spikes in the bursts with rise times in the slowest quartile of the distribution 
were considered to be dendritic in origin, the preferred orientation of bAPs and 
the slowest quartile were still matched within individual dendritic recordings 
(difference in preferred orientation, 41.5 + 58.1°; P = 0.49, paired t-test; n = 9). 
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Extended Data Figure 3 | Dendritic recordings in awake mice exhibit 
dendritic bursts. a, Awake, head-fixed mice viewed drifting gratings during 
electrophysiological recordings. b, A two-photon image of the patched dendrite 
(117 jum from the soma) of a layer 2/3 pyramidal neuron in the mouse visual 
cortex, after filling with Alexa Fluor 594 with the dendritic patch-clamp pipette. 
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c, Dendritic bursts were observed when the preferred orientation was 
presented. d, Tuning curves for the isolated bAPs and dendritic bursts. 

e, Example bursts from three different distal dendritic recordings in awake 
mice. Calibration bars, 25 mV. 
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Extended Data Figure 4 | The diversity of onset dynamics versus membrane 
potential. a, Spikes from each distal dendritic recording (both isolated bAPs 
(black) and spikes in dendritic burst events (red)) were normalized such that 
isolated bAPs had a mean phase slope of 1. The mean baseline membrane 
potential (Vn) of isolated bAPs was subtracted from the mean baseline V,, of all 
spikes. Although many spikes in bursts had a depolarized baseline V,,, relative 
to isolated spikes, there was overlap between the two populations around 

+3 mV. b, Magnification of panel a to show spikes at +3 mV relative to the 


mean baseline V,, of isolated bAPs. c, Histograms of the two populations reveal 
a tendency towards lower phase slope values for spikes in bursts (P = 0.041, 
Kolmogorov-Smirnov test ; 1 = 211 bAPs, 80 spikes in bursts). d, An example 
of bAPs and a spike in a burst (both from the same distal dendritic recording): 
although the bAP has a more depolarized baseline V,,,, it still exhibits a steeper 
phase slope (a kink at the foot of the voltage waveform), indicative of a 
propagated action potential. 
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Extended Data Figure 5 | Ca’* imaging at the site of dendritic recording 
reveals that global Ca’* signals are associated with faster onset spikes. 

a, During dendritic recordings, Ca~” signals were simultaneously imaged at the 
site of the recording and at nearby dendrites. b, In dendritic bursts with global 
Ca’~ signals that were simultaneously observed in all regions of interest (ROIs), 
the spikes recorded at the dendrite exhibited steep onsets, indicating that they 
were probably bAPs. ¢, In local Ca** signals that were observed only in the ROI 


at the site of recording, the dendritic spikes exhibited slower onsets, indicating 
that they were probably locally generated. d, The maximum phase slope of 
spikes occurring during global Ca** events was higher than for spikes 
occurring during local Ca’* events (P = 0.0069, t-test). e, f, When global Gae* 
signals occurred during ongoing local Ca*™ signals, the initiation was 
associated with a steep onset spike. Two examples are shown. 
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tuning. a, Raw data for an example cell in which subthreshold orientation potential was quite sharp and was confined to two directions. 
tuning was observed, although no spikes were fired during stimulus 
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Extended Data Figure 7 | Tuning of action potentials and subthreshold preferred orientation of the subthreshold response during hyperpolarization. 


membrane potential. a, In individual cells, the orientation tuning ofspikesand __d, The black curve is the fitted subthreshold orientation tuning curve (the black 
the membrane potential were highly correlated, indicating that thetuning ofthe _ circles are raw data points), and the red curve is the subthreshold tuning curve 
subthreshold responses was not spurious (mean difference in preferred during hyperpolarization (the red circles are raw data points). The V,,OSI 
orientation, 14.8 + 5.3°). b, In individual cells, the orientation selectivity index values for the control and hyperpolarized conditions are shown next to each 
based on the membrane potential response (V,,OSI) was highly correlated with _ plot. The radial axes are linear and start at 0. The maximal radial axis range is 
the conventional spiking-based OSI. ¢, In individual cells, the preferred shown below each polar plot. The differences in V,,OSI are quantified in Fig. 5g. 
orientation of the control subthreshold response was correlated with the 
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Extended Data Figure 8 | Changes in the driving force for Cl” do not 
account for the effects of hyperpolarization on V,,OSI. a, When the pipette 
solution contained 10 mM Cl , the reversal potential for chloride, Ec), was 
estimated to be —71 mV (based on the assumption that natural cerebrospinal 
fluid contains a similar amount of Cl to the artificial cerebrospinal fluid used). 
In this situation, hyperpolarization decreased the orientation selectivity index. 
b, Even with a low Cl” concentration (4 mM; estimated Eg, —95 mV), the 
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result was the same. c, There was no significant correlation between the driving 
force for Cl and V,,,OSI. d, The data were resampled (15 of the data points in 
cwere selected at random, and the R and P values for that set of data points were 
calculated; this process was repeated 10,000 times), and this process confirmed 
that the result from the correlational analysis in ¢ was not biased by a small 
subset of the data points (the mean R and P values from the resampling analysis 
match the values for the full data set in c well). 
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Extended Data Figure 9 | The effect of intracellular MK-801 on up and 
down states and dendritic spikes. a, To determine whether MK-801 that 
might have leaked out of the pipette during patching affects network circuitry, 
we examined the dynamics of up and down states in recordings in which MK- 
801 was in the pipette and in control recordings (no MK-801). b, Although, in 
general, the membrane potential drifted up slightly (<5 mV on average) and 
the time spent in the up state increased over long recordings (possibly due to the 
anaesthesia wearing off), these trends were identical with or without MK-801 in 
the patch pipette. c, When 1 tM MK-801 was included in the recording pipette, 
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the visually evoked responses contained fewer bAPs and bursts. This trend was 
clear in individual cells (a) and across the population (d). This reduction in 
spiking confirms that dendritic bursts do not occur when NMDA receptors are 
blocked. Because the low firing rate in MK-801 recordings prevented a reliable 
measurement of orientation tuning in the MK-801 dendritic patch-clamp 
recordings, we averaged over all of the stimulus presentations for both 
conditions, resulting in a lower average firing rate for bAPs and dendritic 


bursts. 
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Extended Data Figure 10 | Compartmental modelling of dendritic events. 
a, A detailed reconstruction of a layer 2/3 pyramidal cell was used in the 
simulations. Light green circles over the dendritic tree represent background 
synapses, and dark green circles represent signal synapses (the model had 1,100 
synapses; not all are illustrated). Voltage was recorded at the soma and at all 
dendritic branches simultaneously. b, Activation of signal synapses at 5 Hz 
produced high-frequency dendritic bursts, composed of local dendritic spikes 
and bAPs. These bursts were always accompanied by dendritic Ca** transients. 
The timing of the activation of excitatory synapses on the recorded dendritic 
branch is illustrated. Note how the local excitatory postsynaptic potentials 
(EPSPs) are clearly smaller than the dendritic spikes. c, Examples of specific 
features consistently observed in the model. Isolated bAPs were associated with 
global Ca** transients and had kinked onsets. Dendritic spikes often preceded 
somatic action potentials, had smooth onsets and Ca’ transients that were 
localized to the branches where the spikes were recorded, and clearly started 
before the global transients associated with bAPs. Local dendritic spikes 
initiated in the dendrite could often be recorded in multiple electrotonically 
close dendritic branches. Pairs of local spikes and bAPs reached very high 
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frequencies; the example shows a pair at >400 Hz. When NMDA receptors 
were removed from the simulations, no dendritic spikes were observed, and the 
soma failed to reach the threshold for action potential firing. This also occurred 
when there were no dendritic voltage-activated Na* channels, indicating that 
the generation of dendritic spikes is required for producing axonal output. 

d, Quantification of spike onset for local dendritic spikes and bAPs in the model 
reproduced the experimentally observed effect reported in Extended Data 
Fig. 5d. e, Example trial showing the somatic voltage and recordings for two 
dendrites indicated in a. For each dendrite, the local voltage, the Na* channel 
conductance (gNa, expressed as a fraction of the maximum conductance) and 
the timing of activation of excitatory synapses on the recorded dendrite are 
shown. The gNa traces show that there is significant local Na* channel 
inactivation after the first spike and that subsequent spikes are associated with 
varying degrees of Na* channel conductance. Asterisks denote extreme cases 
when a bAP followed a local dendritic spike at very high frequency and did not 
recruit any local gNa, thereby indicating that the propagation into the recorded 
branch was passive. 
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The activity-dependent transcription factor NPAS4 
regulates domain-specific inhibition 


Brenda L. Bloodgood!*, Nikhil Sharma’**, Heidi Adlman Browne!+, Alissa Z. Trepman’+ & Michael E. Greenberg! 


A heterogeneous population of inhibitory neurons controls the 
flow of information through a neural circuit’ *. Inhibitory synapses 
that form on pyramidal neuron dendrites modulate the summation 
of excitatory synaptic potentials*® and prevent the generation of 
dendritic calcium spikes”*. Precisely timed somatic inhibition lim- 
its both the number of action potentials and the time window dur- 
ing which firing can occur*’. The activity-dependent transcription 
factor NPAS4 regulates inhibitory synapse number and function in 
cell culture”, but how this transcription factor affects the inhibitory 
inputs that form on distinct domains ofa neuron in vivo was unclear. 
Here we show that in the mouse hippocampus behaviourally driven 
expression of NPAS4 coordinates the redistribution of inhibitory 
synapses made onto a CA1 pyramidal neuron, simultaneously increas- 
ing inhibitory synapse number on the cell body while decreasing the 
number of inhibitory synapses on the apical dendrites. This rearran- 
gement of inhibition is mediated in part by the NPAS4 target gene 
brain derived neurotrophic factor (Bdnf), which specifically regu- 
lates somatic, and not dendritic, inhibition. These findings indicate 
that sensory stimuli, by inducing NPAS4 and its target genes, dif- 
ferentially control spatial features of neuronal inhibition in a way 
that restricts the output of the neuron while creating a dendritic 
environment that is permissive for plasticity. 

We assessed NPAS4 expression in the hippocampus by immuno- 
fluorescence microscopy. We found that in hippocampi from mice 
maintained in standard housing, NPAS4 protein is undetectable by 
immunostaining (Fig. la). By contrast, exposure of mice to an enriched 
environment induces NPAS4 protein expression in neurons in the 
pyramidal layer, whereas no NPAS4-positive nuclei are observed in 
the neuropil (Fig. 1b). To obtain more widespread induction of NPAS4, 
mice were injected intraperitoneally with kainic acid to trigger seizure 
activity. This treatment led to NPAS4 expression in nearly all excitatory 
neurons in the pyramidal layer and a minority of inhibitory neurons in 
the neuropil (Fig. 1c). 

We next asked whether the loss of NPAS4 affects inhibitory synapse 
function in vivo. Adeno-associated virus encoding a Cre-GFP fusion 
protein (AAV-Cre-GFP)'! was injected into the CA1 region of the 
hippocampus in Npas4 conditional mice (Npas4'") (Fig. 1d, Extended 
Data Fig. 1). Post-operative animals were transferred to an enriched 
environment or maintained in standard housing for 7 to 12 days. 
Subsequently acute hippocampal slices were prepared, whole-cell volt- 
age clamp recordings were made from neighbouring NPAS4 knockout 
and wild-type neurons (referred to as NPAS4-KO and NPAS4-WT, 
respectively) and pharmacologically isolated miniature inhibitory 
postsynaptic currents (mIPSCs) were recorded (Fig. le). 

We found that when mice were exposed to an enriched environment 
significantly less frequent and slightly smaller amplitude mIPSCs were 
recorded from NPAS4-KO neurons compared to the neighbouring 
NPAS4-WT neurons (Fig. 1g, Supplementary Table 1). Similar results 
were obtained from mice injected with kainic acid to induce NPAS4 


(Fig. 1h). By contrast, no significant difference in mIPSC frequency 
or amplitude was observed between neighbouring NPAS4-WT and 
NPAS4-KO neurons in slices acquired from mice maintained in stand- 
ard housing (Fig. 1f). Next we measured mIPSCs in wild-type mice and 
detected no differences between AAV-Cre-GFP infected and uninfec- 
ted neurons (Extended Data Fig. 2a, b) indicating that the observed 
changes in mIPSCs in Npas4"' * neurons are due to Npas4 excision. 
Finally, we crossed EMX-Cre and Npas4" mice to generate animals 
that lack NPAS4 postnatally and specifically in excitatory neurons 
(Extended Data Fig. 2c, d). Both mIPSC frequency and amplitude were 
significantly reduced in NPAS4-KO neurons from animals injected 
with kainic acid compared to mice maintained in standard housing 
(Extended Data Fig. 2e). From these analyses we conclude that disrup- 
tion of behaviourally induced NPAS4 expression leads to a decrease in 
mIPSC frequency recorded from CA1 pyramidal neurons. 

This decrease in mIPSC frequency could bea consequence of homeo- 
static regulation of all inhibitory synapses or the modification of inhib- 
itory synapses that form on discrete regions of the pyramidal neuron. 
Local inhibitory neuron subtypes have axons that elaborate preferen- 
tially in one or more layers of the hippocampus”’. Thus, stimulation of 
axons in the different laminar domains of the hippocampus and ana- 
lysis of the resulting inhibitory currents has the potential to reveal 
whether the NPAS4-dependent effects on inhibition are due to cell- 
wide or domain-specific regulation of inhibitory synapses. 

We sparsely excised Npas4 and made direct comparisons of layer- 
specific, pharmacologically isolated, monosynaptic evoked inhibitory 
postsynaptic currents (eIPSCs) recorded simultaneously from neigh- 
bouring NPAS4-KO and NPAS4-WT CAI neurons (Fig. 2a). As pre- 
dicted by cable properties, eIPSC rise times correlated with distance 
of the stimulating electrode from the recording electrode (Extended 
Data Fig. 3, Supplementary Table 2) indicating that stimulation of 
axons within a given layer is predominately activating synapses within 
that layer. 

We found that in slices obtained from mice housed in a standard 
environment, equivalent eIPSCs were recorded from NPAS4-WT and 
NPAS4-KO neurons in response to stimulation of axons in any of the 
hippocampal layers (Fig. 2b-e, Supplementary Table 2). However, in 
slices obtained from mice exposed to an enriched environment, stimu- 
lation of axons in the pyramidal layer generated smaller elPSCs in 
NPAS4-KO neurons than in the neighbouring NPAS4-WT neurons 
(Fig. 2h). By contrast, stimulation of axons in stratum radiatum generated 
significantly larger eIPSCs in NPAS4-KO neurons than in neighbour- 
ing NPAS4-WT neurons (Fig. 2g), whereas no difference was detected 
between NPAS4-WT and NPAS4-KO neurons upon stimulation of 
axons in oriens or lacunosum (Fig. 2f, i). Similar results were obtained 
from mice injected with kainic acid to induce NPAS4 (Extended Data 
Fig. 4). These findings indicate that NPAS4 expression does not result 
in uniform, homeostatic regulation of inhibitory synapses. Rather, NPAS4 
functions in an unprecedented manner, simultaneously decreasing 
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Figure 1 | Exposure of mice to 
increased circuit activity reveals an 
NPAS4-dependent regulation of 
inhibition in vivo. 
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inhibition within the proximal apical dendrites while increasing inhibi- 
tion at the soma. 

Extracellular stimulation of axons could result in the depolarization 
of axons of passage that preferentially form synapses on NPAS4-WT 
or NPAS4-KO neurons and in regions other than the intended layer. 
To address this possibility we compared the rise times of the eIPSCs 
recorded from NPAS4-WT and NPAS4-KO neurons and found them 
to be indistinguishable (Extended Data Fig. 3). This indicates that the 
redistribution of inhibition observed is not due to a bias in synaptic 
transmission associated with axons of passage onto NPAS4-WT or 
NPAS4-KO neurons. Additionally, the hippocampus contains several 
interneuron subtypes that form synapses within multiple layers (that 
is, bistratified and trilaminar cells). If NPAS4 regulates synapses made 
by these cell types, it is probable that only boutons that synapse on the 
apical, and not basal, dendrites are affected by loss of NPAS4. 

We next investigated the cellular mechanisms by which NPAS4 dif- 
ferentially regulates inhibition, including possible effects on dendritic 
growth, changes in the probability of inhibitory synaptic vesicle release 
or changes in inhibitory synapse number. We first asked whether loss 
of NPAS4 results in altered dendritic complexity which could indirectly 
lead to larger eIPSCs in response to stimulation of axons in radiatum. 
However, this seems unlikely because Sholl analysis revealed that the 
disruption of NPAS4 function has no effect on dendrite number or 
length (Fig. 3a, b, Extended Data Fig. 5a, b). In addition, the capacitance 
of neurons from mice housed under standard conditions, in an enriched 
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environment or exposed to kainic acid are equivalent (Supplementary 
Table 3). 

We next asked whether disruption of Npas4 leads to changes in the 
presynaptic release of neurotransmitter at axons that synapse onto 
different regions of the pyramidal neuron. We measured the paired 
pulse ratios (PPRs) upon stimulation of axons in each layer, and com- 
pared NPAS4-WT and NPAS4-KO neurons in slices obtained from 
mice housed under standard conditions, in an enriched environment, 
or injected with kainic acid. We found no difference between the PPRs 
of NPAS4-WT and NPAS4-KO neurons under any of these conditions 
(Fig. 3c-e, Extended Data Fig. 5c—-e, Supplementary Table 2) indicating 
that the NPAS4-dependent changes in eIPSCs are not likely due to 
changes in the probability of presynaptic vesicle release. 

Finally we asked whether NPAS4 differentially regulates the number 
of inhibitory synapses on the soma or apical dendrites. To quantify 
inhibitory synapses we co-injected AAVs encoding mCherry-IRES- 
Cre and Cre-dependent GFP-gephyrin fusion protein into the CA1 
region of the hippocampus in wild-type and Npas4‘" mice. GEP- 
gephyrin fusion proteins when expressed at low levels localize to inhib- 
itory synapses without significantly altering neurotransmission'*”’. 
Cre was sparsely expressed and GFP-gephyrin puncta quantified on 
NPAS4-WT and NPAS4-KO neurons. This analysis revealed that when 
mice were kept in standard housing, NPAS4-WT and NPAS4-KO 
hippocampal pyramidal neurons have equivalent densities of GFP- 
gephyrin puncta on the soma and apical dendrites. However, after 
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Figure 2 | Behaviourally induced 
NPAS4 differentially regulates 


inhibitory synapse function across 
the somato-dendritic axis of 
pyramidal neurons. a, Experimental 
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exposure to an enriched environment, NPAS4-KO pyramidal neurons 
have significantly fewer somatic and more dendritic GFP-gephyrin 
puncta than NPAS4-WT neurons (Fig. 3f-h) consistent with the elec- 
trophysiology data described above. Strikingly, comparison of the 
number of GFP-gephyrin puncta measured on wild-type neurons 
revealed that exposure to an enriched environment leads to an increase 
in the number of somatic and a decrease in the number of dendritic 
inhibitory synapses. In both domains of the neuron, the behaviourally 
triggered changes in inhibition are NPAS4 dependent. 

NPAS4 is a transcription factor and thus to understand how it 
mediates sensory-dependent changes in inhibitory synapse number 
we sought to identify NPAS4 target genes and assess their function. 
We focused our attention on putative NPAS4 targets that satisfy three 
criteria: (1) by deep sequencing NPAS4-bound DNA (ChIP-seq), 
NPAS4 is shown to be bound within 10 kilobases of the putative target 
gene; (2) by RNA-seq, the mRNA transcribed near the NPAS4 peak is 
induced greater than fivefold (Extended Data Table 1)'*; (3) upon 
short hairpin RNA (shRNA)-mediated knockdown of the putative 
target gene there is a significant change in mIPSC frequency or ampli- 
tude. Through this screen we identified 16 putative NPAS4 targets that 
display a change in mIPSC frequency or amplitude upon knockdown 
with gene-specific shRNAs in organotypic hippocampal slices, five of 
which we further validated (Extended Data Figs 6-8). 

Of the putative NPAS4 target genes identified by the screen, Bdnf 
seemed a possible candidate to control the number of inhibitory synapses 
that form on a specific region of an excitatory neuron in response to 
sensory stimulation. BDNF expression is regulated by neuronal activity 
in vivo'>-'* and it regulates inhibitory synapse number and function’. 
In addition, Bdnf mRNAs can be targeted to discrete regions of the 
neuron”, and BDNF-containing vesicles are known to be secreted from 
the axon, soma or dendrites of pyramidal neurons**. Furthermore, 
we confirmed by ChIP and quantitative PCR with reverse transcription 
(qRT-PCR) that NPAS4 binds to three sites within the Bdnf gene 
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(Fig. 4a, b), and that induction of the transcription of multiple Bdnf 
isoforms is significantly reduced in the hippocampus of Npas4 knock- 
out (Npas4/ ~) compared to wild-type (C57BL/6J) mice (Fig. 4c). 

To determine if BDNF is a target of NPAS4 that selectively regulates 
somatic and/or dendritic inhibition of pyramidal neurons in vivo, we 
injected AAV-Cre-GFP into BDNF conditional knockout mice (Bdnf uy 
mice and compared eIPSCs between neighbouring Bdnf knockout and 
Bdnf wild-type neurons (BDNF-KO and BDNF-WT, respectively) in 
response to stimulation of axons in the somatic or dendritic layers. In 
mice housed under standard conditions, we found no differences in 
eIPSC amplitude measured from neighbouring BDNF-KO and BDNF- 
WT neurons upon stimulation of axons in any of the layers of the 
hippocampus (Fig. 4d-f, Supplementary Table 2). 

However, when we exposed mice to an enriched environment to 
induce NPAS4 as well as BDNF expression and secretion, stimulation 
of axons in the pyramidal layer results in a significantly smaller ampli- 
tude eIPSC in BDNF-KO neurons relative to neighbouring BDNF-WT 
neurons (Fig. 4h). The disruption of Bdnf function did not have a 
significant effect on eI[PSC amplitude when axons in radiatum or oriens 
were stimulated (Fig. 4g, i). Similar results were observed when BDNF 
expression was induced with kainic acid (Extended Data Fig. 9). Taken 
together these findings indicate that the differential effects of NPAS4 
on pyramidal neuron inhibition are mediated, at least in part, through 
the induction of BDNF that functions selectively and locally to pro- 
mote inhibition at the soma. Given that the disruption of BDNF func- 
tion does not affect e[PSCs in radiatum, the ability of NPAS4 to relieve 
inhibition on the apical dendrites of CA1 pyramidal neurons is prob- 
ably mediated by NPAS4 targets other than BDNF. 

These findings indicate that when a mouse engages with its envir- 
onment, the neuronal activity-regulated transcription factor NPAS4 is 
expressed in pyramidal neurons of the hippocampus where it pro- 
motes an increase in the number of inhibitory synapses on the cell 
soma and a decrease in the number of inhibitory synapses on the apical 
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Figure 3 | Behaviourally induced NPAS4 regulates inhibitory synapse 
number. a, Standard housing example NPAS4-WT and NPAS4-KO CA1 
pyramidal neurons (left) and quantification of NPAS4-WT (black, n = 10 neurons) 
and NPAS4-KO (red, m = 10 neurons) dendrites (right). b, Enriched environment 
Sholl analysis, as in a. NPAS4-WTT, n = 7 neurons; NPAS4-KO, n = 10 neurons. 
c-e, PPRs measured from NPAS4-WT and NPAS4-KO neurons in response to 
stimulation in radiatum (c), pyramidale (d) or oriens (e). Standard housing (black, 
R, n= 9 neurons; P, n = 10 neurons; O, m = 8 neurons) or enriched environment 
(green, R, n = 16 neurons; P, n = 11 neurons; O, n = 16 neurons). Examples are 
from standard housing (top) and are shown normalized to peak current recorded 
from the NPAS4-WT neuron. Scale bars indicate per cent change from wild type. 
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ISI, interstimulus interval. f, Example image of dendrites in radiatum. 

g, Quantification of dendritic GFP-gephyrin puncta from NPAS4-WT (black, 
standard housing, n = 15 sections/63 dendritic segments; enriched environment, 
n= 14 sections/40 dendritic segments) and NPAS4-KO (red, standard housing, 
n= 18 sections/172 dendritic segments; enriched environment n = 18 sections/ 
167 dendritic segments) neurons. Enriched environment P< 0.01. h, Example 
image of somata in pyramidale. i, Quantification of somatic GFP-gephyrin puncta 
from NPAS4-WT (black, standard housing, n = 14 sections/32 soma; enriched 
environment, n = 11 sections/20 soma) and NPAS4-KO (red, standard housing, 
n = 17sections/94 soma, enriched environment: n = 21sections/75 soma) 
neurons. Enriched environment P < 0.05. All data are shown as mean + s.e.m. 


dendrites. In light of recent reports indicating that inhibition within 
these cellular domains can have distinct functions”****, we speculate 
that this redistribution of inhibition will have significant effects on 
information processing in the postsynaptic neuron. For example, the 
opposing regulation of somatic and apical dendritic inhibition may allow 
integration or plasticity of excitatory events in the apical dendrites while 
still limiting the generation of somatic action potentials and thus the 
propagation of information to downstream neurons. This is in contrast 
to homeostatic scaling of inhibitory synapses—a process whereby the 


Figure 4 | Behaviourally induced BDNF regulates somatic but not dendritic 
inhibition. a, Schematic of Bdnf gene. Exons 1-9 are indicated. b, qPCR of 
NPAS4 bound DNA. ChIP samples are from wild-type mice in standard 
conditions (black) or after kainic acid injection (red). us, upstream region; P1, 
promoter 1; [3, intron 3; neg 1, control region; P4, promoter 4; neg 2, control 
region. Enrichment relative to input DNA. n = 3. *P < 0.05. ¢, Induction of 
Bdnf exons in wild-type (black, standard housing, n = 5 animals; kainic acid, 
n = 6 animals) or Npas4 ‘animals (red, standard housing, n = 5 animals; 
kainic acid, n = 6 animals) mice as measured by qRT-PCR with exon specific 
primers, E1-9.*P < 0.05. Data in b and c are shown as mean = s.e.m. 

d-f, eIPSCs measured from mice maintained in standard housing. Top shows 
average eIPSC, normalized pairwise to the wild-type neuron, measured from 
BDNF-WT (black) and BDNF-KO (green) neurons in response to stimulation 
in radiatum (d), pyramidale (e) or oriens (f). Scale bars indicate per cent change 
from wild type. Bottom shows eIPSC amplitude measured from pairs of 
neighbouring BDNF-KO and BDNF-WT neurons in response to stimulation of 
axons in radiatum (d) (n = 14 pairs), pyramidale (e) (n = 15 pairs) or oriens 
(f) (n = 11 pairs). Open circles represent BDNF-KO/BDNF-WT pairs. Red 
circles indicate mean = s.e.m. g-i, eIPSC measured from mice housed in an 
enriched environment. Data are shown as in d-f. R, n = 10 pairs; P, n = 14 
pairs; P< 0.05; O, n = 14 pairs. 
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action potential output of a neuron is regulated by cell-wide scaling of 
synaptic strengths that preserves the relative strengths among synapses. 
The NPAS4-dependent redistribution of inhibitory circuitry may under- 
lie recent findings that associate NPAS4 function with contextual or 
stimulus dependent fear conditioning”””. 


METHODS SUMMARY 


Animals were handled according to protocols approved by the Harvard University 
Standing Committee on Animal Care and the University of California San Diego 
Institutional Animal Care and Use Committee, and were in accordance with 
federal guidelines. Full details of experimental procedures for animal handling, 
virus injections, acute and organotypic slice preparation, electrophysiology, 
immunostaining, Sholl analysis, synapse labelling and quantification, ChIP, 
qRT-PCR and western blots are provided in the Methods. 
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METHODS 


Animal husbandry and handling. Animals were handled according to protocols 
approved by the Harvard University Standing Committee on Animal Care and the 
University of California San Diego Institutional Animal Care and Use Committee, 
and were in accordance with federal guidelines. The following animal lines were 
used: Npas4"’ * NPAS4 knockout (Npas4 '~) mice’’, wild type (C57BL/6J, JAX 
000664), Bdnf‘ (Bdnf < tm3Jae>/J, JAX 004339) and EMX-Cre (B6.129S2- 
EMX1™ Ky JAX 005628). Both male and female mice were used. Elec- 
trophysiology, Sholl analysis and GFP-gephyrin experiments were performed 
on animals between postnatal day 21 and 28 (P21-P28). Immunostaining, ChIP 
and qRT-PCR experiments were done on samples collected from P25-P35 animals. 

For experiments in which mice were exposed to an enriched environment, three 
days after virus injection (P16-P18) animals (dam and pups) were moved to a 
larger cage that contained a running wheel, hut, tunnel and several other novel 
objects. To maximize novelty, the objects in the environment were rearranged and 
new objects introduced every second day. Mice were housed in the enriched 
environment for 4-10 days. Although we detect a limited number of NPAS4 positive 
excitatory neurons at a given time point, exploration of a novel environment 
increases NPAS4 immunoreactivity to an extent similar to that observed for other 
activity inducible genes such as Fos and Arc. Moreover, behaviourally triggered 
NPAS4 induction appears to occur iteratively, and in distinct subpopulations of 
neurons as the animal engages and explores novel features of the environment, 
such that the number of NPAS4 immunopositive neurons detected at any given 
time point significantly underestimates the number of neurons that express NPAS4 
over a period of several days. 

For experiments in which seizures were induced, kainic acid (2.5-10 mg per kg) 
was injected intraperitoneally. Mice were euthanized 2h after the first visible 
seizure for ChIP, at 3 hours for immunohistochemistry, at 6h for qRT-PCR or 
24h later for electrophysiology experiments. These time points were selected to 
allow sufficient time for expression of NPAS4 protein, the execution of an NPAS4- 
dependent program of gene expression and potential synaptic regulation but 
before detectable seizure-related cell death. 

Sterotaxically guided surgery. All surgeries were performed according to proto- 
cols approved by the Harvard University Standing Committee on Animal Care 
and the University of California San Diego Institutional Animal Care and Use 
Committee, and were in accordance with federal guidelines. Surgeries were per- 
formed on mice between P13 and P15. Animals were deeply anaesthetized by 
inhalation of isoflurane (initially 3-5% in O, maintained with 1-2%) and secured 
in the stereotaxic apparatus (Kopf). Animal temperature was maintained at 37 °C. 
The fur was shaved and scalp cleaned with betadine and 100% ethanol three times 
before an incision was made to expose the skull. A small hole was drilled through 
the skull and the CA1 region of hippocampus was specifically targeted (medial/ 
lateral: + 2.9 mm; anterior/posterior: —2.5 mm; dorsal/ventral: 2.8 mm below the 
dura) and virus was injected (250-300 nl; 150 nl min“ '). Five minutes post-injection, 
the needle was retracted, the scalp sutured and the mouse returned to its home 
cage. All animals were monitored for at least one hour post-surgery and at 12-h 
intervals for the next 5 days. Post-operatively, analgesic (flunixin, 2.5 mg per kg) 
was administered at 12-h intervals for 72h. 

Virus production. AAV-Cre-GFP was produced by the Harvard Gene Therapy 
Initiative or UNC Vector Core with a plasmid provided by M. During (Ohio State 
University). The plasmids containing the genome for AAV-EFla-YFP-2A-Cre, 
AAV-mCherry-IRES-Cre and AAV-FLEX-GFP-gephyrin were generated using 
standard molecular cloning techniques and were produced by the UNC Vector Core. 
Acute slice preparation. Transverse hippocampal slices were prepared from 
Npas4"", EMX-Cre; Npas4", Bdnf“' or C57BL/6 mice (P21-P28). Animals were 
anaesthetized by inhalation of isoflurane. The cerebral hemispheres were quickly 
removed and placed into ice-cold choline-based artificial cerebrospinal fluid 
(choline-ACSF) consisting of (in mM): 110 choline-Cl, 25 NaHCOs, 1.25 NasHPO,, 
2.5 KCl, 7 MgCl, 25 glucose, 0.5 CaCh, 11.6 ascorbic acid, 3.1 pyruvic acid and 
equilibrated with 95% O,/5% COs. Tissue was blocked and transferred to a slicing 
chamber containing choline-ACSF. Slices (300 jum) were cut with a LeicaVT1000 s 
vibratome (Leica Instruments) and transferred to a holding chamber containing 
ACSF consisting of (in mM): 127 NaCl, 25 NaHCOs, 1.25 Na,HPO,, 2.5 KCl, 
2 CaCh, 1 MgCh, 25 glucose, and saturated with 95% O,/5% CO. Slices were 
incubated at 30 °C for 30-40 min and then kept at room temperature for no more 
than 6h until recordings were performed. For mice injected with AAV-Cre-GFP, 
slices showing infection of between 20% to 40% in CA1, as determined by GFP 
fluorescence, were used for recording. Slices showing greater than ~40% infected 
neurons or infection outside of CA1 were discarded. 

Organotypic slice preparation and transfection. Hippocampi were rapidly dis- 
sected from wild-type mice at P7 in ice cold dissection media consisting of (in mM): 
1 CaCl, 5 MgCl, 10 glucose, 4 KCl, 26 NaHCOs, 218 sucrose, 1.3 NaH,PO,4"H,0, 
30 HEPES. Tissue was transferred to a tissue chopper (Mclliwan Ted Pella) and 


400-jum thick sections cut. Sections were transferred to tissue culture plates and 
grown on PTFE inserts (Milicell Organotypic insert, EMD Millipore) in slice culture 
media consisting of (in mM or percentage) 1X MEM, 20% horse serum, 1 
L-glutamine, 0.125% ascorbic acid, 1 CaCl, 2 MgCl, 12.8 glucose, 5.25 
NaHCOs, 30 HEPES, pH 7.4, 320 mOsm) for 7-10 days. On the second day in 
vitro, cultures were biolistically (Gene Gun, Biorad) co-transfected with 1-,1m gold 
particles coated with GFP and shRNAs targeting the gene of interest (particles 
prepared with GFP 15h1g, 3-9 ug of shRNA, pcDNA3 to a final DNA mass of 
50 pig as per the manufacturer’s instructions). The shRNA sequences are indexed 
in Supplementary Table 4. 

Electrophysiology. Whole-cell voltage clamp recordings were obtained from CA1 
pyramidal neurons visualized with infrared, differential interference contrast micro- 
scopy. Neurons were held at —70 mV except for experiments measuring evoked 
responses from stimulation of axons in stratum lacunosum, during which neurons 
were held at 0 mV. Patch pipettes (open pipette resistance 2-4 MQ) were filled 
with an internal solution consisting of (in mM) 147 CsCl, 5 Naz-phosphocreatine, 
10 HEPES, 2 MgATP, 0.3 Na,GTP and 2 EGTA. Osmolarity and pH were adjusted 
to 300 mOsm and 7.3 with double distilled water and CsOH, respectively. In 
experiments in which mIPSCs were recorded, currents were pharmacologically 
isolated with bath application of 0.5 UM tetrodotoxin citrate (Tocris Bioscience), 
10M (R)-CPP (Tocris Bioscience) and 10}1.M NBQX disodium salt (Tocris 
Bioscience). In experiments in which elPSCs were recorded, inhibitory currents 
were pharmacologically isolated with bath application of CPP and NBQX. Addi- 
tionally, 5 mM QX-314 (Sigma) was added to the internal solution. Extracellular 
stimulation of local axons within specific lamina of the hippocampus was delivered 
by current injection through a theta glass stimulating electrode that was placed in 
the centre of the relevant layer (along the somato-dendritic axis of the CA1 neu- 
ron) and within 100-200-1m laterally of the patched pair. The stimulus strength 
was the minimum required to generate an e[PSC in both NPAS4-KO and NPAS4- 
WT neurons. 

Data acquisition and analysis. Electrophysiology data were acquired using pClamp 
software, either a Multiclamp 700B or Axoclamp 200B amplifier, and digitized 
with a DigiData 1440 data acquisition board (Axon Instruments). Data were 
sampled at 10 kHz and filtered at 4 or 6 kHz except for experiments with stimu- 
lation of axons in lacunosum which were filtered at 1 kHz. Off-line data analysis 
was performed using custom software written in Igor Pro by B.L.B. (Wavemetrics). 

Experiments were discarded if the holding current was greater than —500 pA or 
if the series resistance was greater than 25 MQ. In experiments in which direct 
comparisons were made between two neurons, recordings were discarded if the 
series resistance differed by more than 25% between the two recordings. All 
recordings were performed at room temperature (~20-22 °C). 

Threshold for mIPSC detection was determined independently for each neuron 
and was based on the average root mean square (r.m.s.) of the first 150 ms of 
the recording. Amplitude threshold was set to 1.5 times r.m.s. for that cell and 
recordings were discarded if the r.m.s. was greater than 6 pA. This strategy resulted 
in fewer missed events and fewer erroneously called events than applying a single 
amplitude threshold. 

The amplitude of eIPSCs was calculated by averaging the amplitude 0.5 ms 

before to 2 ms after the peak of the current. Data are shown as positive values 
for clarity. Slopes of the rise times of the eIPSCs were measured by normalizing the 
eIPSC, then measuring the slope between 10-90% of the peak. Paired pulse ratios 
(PPR) were calculated by recording a template elPSC for each cell, subtracting the 
template wave from the first pulse, and then measuring the corrected amplitude of 
the second peak. 
Immunohistochemistry, Sholl analysis and GFP-gephyrin quantification. 
Animals were anaesthetized by inhalation of isoflurane. Hippocampi were rapidly 
dissected in ice-cold dissection media as described above and drop fixed in 4% 
paraformaldehyde in PBS at 4 °C for 1.5-3 h followed by overnight incubation in 
30% sucrose in PBS. Cryoprotected tissue was stored in Tissue-Tec O.C.T. at 
—80 °C and subsequently sectioned at a thickness of 40-45-pm (Leica CM1950 
cryostat) for immunostaining or GFP-gephyrin puncta quantification, or at a 
thickness of 100-j1m for Sholl analysis (Leica Instruments). 

For NPAS4 immunostaining, hippocampal sections were blocked in 2% goat 
serum and 0.2% Triton X-100 in PBS for 1h at room temperature. Sections were 
incubated in primary antibody overnight at 4 °C, washed three times in PBS, incu- 
bated in species-matched fluorescently conjugated secondary for 1h at room 
temperature and washed again in PBS. Finally, sections were mounted on slides 
with Fluoromount-G (SouthernBiotech). The following antibodies were used: 
rabbit anti- NPAS¢4 (1:1,000, made in house), mouse anti-NeuN (1:1,000, Millipore) 
and rabbit anti-GFP (1:500, Invitrogen). The secondary antibodies were Alexa-488 
or Alexa-555 against the appropriate species (1:500, Invitrogen). Sections were 
imaged on a Zeiss LSM5 Pascal confocal microscope with a X20 objective. In cases 
in which the tissue was too large, multiple images were taken and stitched together 
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post-hoc in Adobe Photoshop. Images for DAPI and NeuN quantification were 
obtained on an Olympus BX61VS microscope with a X20 objective and subse- 
quently process and exported into JPEG format with the manufacturer’s software. 
When necessary, nuclei were labelled by adding Hoechst 33582 dye (Invitrogen) to 
the final PBS (1:5,000). Nuclei positive for the relevant marker were counted and 
quantified using ImageJ. In Npas4 * mice that have been sparsely infected with 
AAV-Cre-GFP and then administered kainic acid, we see dendritic NPAS4 
immunoreactivity in the NPAS4-WT but not in the NPAS4-KO neurons. This 
indicates to us that the low levels of dendritic staining we observe with our NPAS4 
antibody is unlikely due to cross-reactivity with another protein. 

The neurons used for Sholl analysis were from animals injected with the AAV- 
EFla-YFP-2A-Cre. Tissue was fixed as described above. To maximize signal, GFP 
signal was amplified by staining with rabbit anti-GFP antibodies (1:1,000, Invitrogen). 
The secondary antibody was anti-rabbit Alexa-488 (1:500, Invitrogen). Neurons 
selected for analysis if there were few or no other infected neurons nearby to 
facilitate reconstruction of dendritic morphology. After imaging, the neurons were 
traced and skeletonized using the Neuron] plugin for ImageJ and Sholl intersec- 
tions were subsequently quantified and significance determined by ANOVA. 

The neurons used for GFP-gephyrin puncta quantification were from animals 
injected with AAV-mCherry-IRES-Cre and AAV-FLEX-GFP-gephyrin. GFP was 
not amplified by immunostaining. Images were acquired on a Leica SP5 confocal 
with resonant scanner through a X63 oil immersion objective. Fluorophores were 
excited using Argon 488 and HeNe 594 lasers and emitted photons detected through 
GFP or Alexa 594 filter cubes with Leica hybrid detectors (HyD3). Images were 
acquired at 16 bit resolution, 1024 X 1024 pixels and each frame the average of 
8 scans. Images were analysed with ImageJ. Maximum intensity projections were 
made of each channel. A Yen threshold was applied to the green channel and 
puncta identified using the particle counter. Our reported puncta densities are 
lower than expected by electron microscopy. This is probably due to detection 
limitations of our microscope, incomplete or faint labelling of synapses by GFP- 
gephyrin, and the thickness of our sections. 

Chromatin immunoprecipitation. Hippocampi from wild-type mice main- 
tained in standard housing or injected with kainic acid (2.5-10 mg per kg, 2-3h 
prior) were rapidly dissected in ice-cold dissection media (as above). The tissue 
was homogenized and DNA and protein were cross linked for 11 min (in mM: 10 
HEPES-NaOH pH 7.5, 100 NaCl., 1 EDTA. 1 EGTA, 1% formaldehyde, in PBS). 
Formaldehyde was quenched with 2 M glycine in PBS and the sample incubated 
onarocker at room temperature for 5 min. Tissue was pelleted (2,000 r.m.p., 5 min, 
4 °C), washed with PBS plus PMSF, and re-pelleted (2,000 r.p.m., 5 min, 4 °C). The 
supernatant was removed and the pellet resuspended in 5 ml L1 buffer (in mM: 50 
HEPES-NaOH 7.5, 140 NaCl, 1 EDTA, 1 EGTA, 0.25% Triton X-100, 0.5% NP40, 
10% glycerol, 1M BGP, 0.2M NaVOu,, 0.5M NaF, 1X complete protease inhibitor 
cocktail without EDTA (Roche)). The sample was the further homogenized by ten 
strokes with a tight pestle in a Dounce homogenizer, pelleted (2,000 r.p.m., 5 min, 
4 °C), washed in L1, pelleted and then resuspended in buffer L2 (in mM: 10 Tris- 
HCI pH 8.0, 100 NaCl, 1M BGP, 0.2 M NaVOg, 0.5 M NaF, 1X complete protease 
inhibitor cocktail without EDTA (Roche)). Samples were placed on a rotator at 
room temperature for 10 min, pelleted and resuspended in L3 buffer (in mM 10 
Tris-HCl pH8.0, 1 EDTA, 1 EGTA, 1M BGP, 0.2M NaVOg, 0.5M NaF, 1x 
complete protease inhibitor cocktail without EDTA (Roche)) to a final concentra- 
tion of 10 million nuclei per ml. Twenty million nuclei were sonicated using a 
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Misonix 3000 for a total time of 7 min, power 7.5 and at 4°C. NPAS4 immuno- 
precipitation was done using protein G Dyna Beads (Life Technologies) according 
to the manufacturer’s instructions using 4 1g of NPAS4 antibody'®. NPAS4-bound 
DNA was quantified using qPCR. A list of putative NPAS4 target genes was deter- 
mined by analysing ChIP and RNA-seq data published under Gene Expression 
Omnibus accession number GSE21161. 

qRT-PCR. qRT-PCR was on cDNA samples made from RNA icollected from 
wild-type of NPAS4-KO hippocampi from mice that been housed in standard 
conditions or injected with kainic acid. RT-PCR was carried out with SYBR 
green (Life Technologies) and primers with the following sequences for NPAS4 
ChIP samples probed at the Bdnflocus: us 5’-TGGTGAAAACACTTGGGCATA-3’; 
5'-TGATGAGCTGGGAACTCTGC-3’; P1 5'-GTCCGCTGGAGACCCTTAG 
T-3', 5’-CTGAGCCAGTTACGTGACCA-3’'; 13 5’-CTTCCCAGATGGTGCT 
GT-3', 5’-AATCTCCCAGTTCTGCGTTC-3’; P4 5’-CCCTGGAACGGAATTC 
TTCT-3’, 5’-TGCACGAATTACCAGAATCA-3’; Neg] 5’-CATTCAGCACCT 
TGGACAGA-3’, 5’-GCTTGACAGCGAGGAAAAGA-3'; Neg2 5'-GGCCTG 
AAGTTCAAGGATGG-3’',5’-GCCTGCCACTGAAGCTTGTA-3’. 

In cDNA samples probed for various Bdnf isoforms, Npas4, PCSK1, Adcyap1 
and Penk1, the primers used were: Bdnf exon1 5'-CACTGAGCAAAGCCGAAC 
TTCTC-3’, 5'-TCACCTGGTGGAACATTGTGGC-3’; Bdnf exon 2 5'-AGCG 
GTGTAGGCTGGAATAGACTC-3’, 5'-GGTGGAACTTCTTTGCGGCTTAC-3’; 
Bdnf exon 3 5'-TACCCCTTTCTATCATCCCTCCCCG-3’, 5'-GAAGCATCCG 
GCCCGACAGTTCCAC-3'; Bdnf exon 4 5'-CGCCATGCAATTTCCACTATC 
AATAATTTAAC-3’, 5’-CGCCTTCATGCAACCGAAGTATG-3’; Bdnf exon 5 
5'-CCATAACCCCGCACACTCTGTGTAG-3’, 5’-CTTCCCGCACCTTCACA 
GTTCCAC-3’; Bdnf exon 6 5'’-GATCCGAGAGCTTTGTGTGGAC-3’, 5'-GCC 
TTCATGCAACCGAAGTATG-3’; Bdnf exon 7 5'’-GGTCCAAGGTCAACGTT 
TA-3’, 5'-TAAACGTTGACCTTGGACC-3’, Bdnf exon8 5'-GAACAAACTGA 
TTGCTGAA-3’, 5'-TTCAGCAATCAGTTTGTTC-3’; Bdnf exon9/coding exon 
5'-GATGCCGCAAACATGTCTATGA-3’, 5'’-TAATACTGTCACACACGCTC 
AGCTC-3’; Npas4: 5'-AGGGTTTGCTGATGAGTTGC-3’, 5'-CCCCTCCACT 
TCCATCTTC-3’; PCSK1: 5'-TGGAGTTGCATATAATTCCAAAGTT-3’, 5'-AG 
CCTCAATGGCATCAGTTAC-3’; Adcyap1 5'-GAGAATCTGGGGGCAAGTC 
T-3', 5’-CACCAGCACCTGATCTGTCA-3’; Penk 5'-CCCAGGCGACATCAA 
TTT-3’, 5'-TCTCCCAGATTTTGAAAGAAGG-3’. 

Western blot analysis. Myc-tagged CDNA encoding NPAS4, BDNF, Pesk1, Adcyap1 
or PENK1 was transfected into 293T cells using Lipofectamine. Twenty-four hours 
later cells were lysed in 2X Laemmli buffer and boiled for 5 min. Lysates were 
resolved by SDS-PAGE and immunoblotted with antibodies targeting Myc 
(1:1,000, Abcam) or actin (1:2,500, Abcam). 

Statistics. All data are shown as the mean + s.e.m. unless otherwise noted. Data 
for which a specific P value is not indicated are not significant (P > 0.05). 

For electrophysiology experiments, significance was determined by paired two- 
tailed t-test for direct comparisons of neighbouring neurons, two-tailed t-test for 
comparisons between populations, one-way ANOVA (Dunnett’s test) for PPRs, 
and Wilcoxon for mIPSCs in the screen of putative NPAS4 target genes. All data 
sets were acquired from at least 3 animals, 2 slices from each animal. 

For imaging experiments, significance was determined by ANOVA or two- 
tailed t-test. All data sets were acquired from at least 3 animals, 3-5 images per 
animal. 

For qRT-PCR experiments, significance was determined by two tailed t-test. 
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Extended Data Figure 1 | Infection with AAV-Cre-GFP effectively excises immunostained NPAS4 (red) and NeuN (blue) protein. c, Quantification of 
Npas4. a, Schematic of the AAV-Cre-GFP virus genome. b, Representative neurons with overlapping GFP and NPAS4 in the sections represented in 
hippocampal section from an AAV-Cre-GEP; Npas4"" animal 3h after kainic _b. n = 14 neurons, *P <0.01. 

acid injection. Sections were imaged for native GFP fluorescence (green) and 
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Extended Data Figure 2 | NPAS4-dependent changes in inhibition require 
loss of NPAS4 in excitatory neurons. a, mIPSCs frequency (left) and 
amplitude (right) were measured from pairs of neighbouring GFP* and GFP~ 
neurons from wild-type (C57BL/6) mice maintained in standard housing. 
n= 17 pairs. b, mIPSCs frequency (left) and amplitude (right) were measured 
from pairs of neighbouring GEP* and GFP” neurons from wild-type (C57BL/ 
6) mice 24h after injection of kainic acid. n = 17 pairs. Open circles indicate 
individual GFP*/GFP~ pairs. Red circles indicate mean ~ s.e.m. 
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c, Representative hippocampal CA1 section from an EMX-Cre;Npas4* and 
EMX-Cre;Npas4"" mouse 3h after kainic acid injection. Sections were 
immunostained for NPAS4 (red) protein. d, Quantification of NPAS4 
immunoreactivity. n = 9 sections from 3 animals. *P < 0.05. e, mIPSCs 
frequency and amplitude measured from neurons in EMX-Cre;Npas4"" mice 
maintained in standard housing (n = 14 neurons) or 24h after kainic acid 
injection (n = 16 neurons). freq: P< 0.01; amp: P< 0.05. 
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Extended Data Figure 3 | Stimulation of axons in different hippocampal 
layers generates eIPSCs with distinct rise times. a, Cartoon of a CA1 
pyramidal neuron and example eIPSCs generated in response to stimulation of 
axons in lacunosum (L, blue), radiatum (R, yellow), pyramidale (P, red) and 
oriens (O, green). Slopes (10-90% of the peak eIPSC) are indicated by coloured 
lines. b, Standard housing summary of the slopes of the eIPSC rise times 
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recorded from neighbouring NPAS4-WT and NPAS4-KO neurons in response 
to stimulation of axons in each of the four hippocampal layers. Measurements 
are from eIPSCs in Fig. 2b-e. c, Enriched environment summary of the slopes as 
in b. Measurements are from eIPSCs in Fig. 2f-i. d, Kainic acid summary of the 
slopes as in (b). Measurements are from eIPSCs in Extended Data Fig. 4b-e. 
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Extended Data Figure 4 | NPAS4 differentially regulates inhibitory 
synapses across the somato-dendritic axis of pyramidal neurons in response 
to kainic acid. a, Experimental configuration. L, lacunosum; R, radiatum; P, 
pyramidale; O, oriens. b-e, e[PSCs measured from mice injected with kainic 
acid. Top shows average eIPSC, normalized pairwise to the wild-type neuron, 
measured from NPAS4-WT (black) and NPAS4-KO (green) neurons in 
response to stimulation in lacunosum (b), radiatum (c), pyramidale (d) or 


oriens (e). Scale bars indicate per cent change from wild type. Bottom shows 
eIPSC amplitude measured from pairs of neighbouring NPAS4-KO and 
NPAS4-WT neurons in response to stimulation of axons in lacunosum 

(b) (n = 13 pairs); radiatum (c) (n = 17 pairs, P< 0.05); pyramidale 

(d) (n = 18 pairs, P< 0.01); or oriens (e) (n = 15 pairs, P< 0.05). Open circles 
represent NPAS4-KO/NPAS4-WT pairs. Red circles indicate mean + s.e.m. 
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Extended Data Figure 5 | Sholl analysis and paired pulse ratios for NPAS4- KO (red, 7 = 13 neurons) dendrites (right). c-e, PPRs measured from NPAS4- 
WT and NPAS4-KO neurons from mice injected with kainic acid. WT and NPAS4-KO neurons in response to stimulation in radiatum 

a, Schematic of the AAV-YFP-2A-Cre virus genome used for Sholl analysis. (c), pyramidale (d) or oriens (e). Standard housing (black) or kainic acid (red). 
b, Kainic acid example NPAS4-WT and NPAS4-KO CAI pyramidal neurons _—_ Standard housing data are re-plotted from Fig. 3c-e. R, n = 11 neurons; 

(left) and quantification of NPAS4-WT (black, n = 10 neurons) and NPAS4- _P, m = 13 neurons; O, n = 8 neurons. All data are shown as mean + s.e.m. 
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Extended Data Figure 6 | Functional screen of putative NPAS4 target genes shown. Knockdown of genes that result in fewer or smaller mIPSCs are shown 
identifies many genes that regulate mIPSCs frequency or amplitude. on top (mustard); knockdown of genes that result in more or larger mIPSCs 
a, b, mIPSCs were recorded from CA1 pyramidal neurons transfected with a _—_ are shown on the bottom (blue). The dashed line represents the median value 
pool of three shRNAs targeting a single putative NPAS4 target gene. mIPSC _ from control conditions. P < 0.05. 

frequency (a) and amplitude (b) of those significantly different from control are 
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Extended Data Figure 7 | Validation of five putative NPAS4 target genes. 
a-e, Select putative NPAS4 target genes that influence mIPSC frequency and/or 
amplitude were subjected to additional validation: Npas4 (a), Bdnf (b), Pcsk1 
(c), Penk1 (d) and Adycap1 (e). Top shows mIPSC frequency and amplitude 
recorded from CAI pyramidal neurons that were transfected with individual 
shRNAs from the pool used in the primary screen. The mean + s.e.m. mIPSC 
frequency and amplitude recorded from control neurons is indicated by the 


grey bar. Expression of individual shRNAs phenocopied expression of the pool. 
Bottom shows expression of target-specific shRNAs reduces expression of the 
target protein. f, RT-PCR using primers specific to the target gene indicate 
that all five genes are induced in the hippocampus and their induction is 
misregulated in NPAS4 knockout neurons. Standard housing, n = 5 animals; 
kainic acid, n = 6 animals. All data are shown as mean + s.e.m. *P < 0.05. 
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Extended Data Figure 8 | Many putative NPAS4 target genes do not putative NPAS4 target gene. mIPSC frequency (a) and amplitude (b) of those 
regulate inhibitory synapses. a, b, mIPSCs were recorded from CA1 not significantly different from control are shown. 
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Extended Data Figure 9 | BDNF regulates somatic inhibitory synapses in 
response to kainic acid. a—c, e[PSCs were measured simultaneously from 
neighbouring BDNF4-WT (black) and BDNF-KO (green) neurons from mice 
injected with kainic acid. Top shows average eIPSC, normalized pairwise to the 
wild-type neuron, measured from BDNF-WT (black) and BDNF-KO (green) 
neurons in response to stimulation of axons in radiatum (a), pyramidale (b) or 


oriens (c). Scale bars indicate per cent change from wild type. Bottom shows 
eIPSC amplitude measured from pairs of neighbouring BDNF-KO and BDNF- 
WT neurons in response to stimulation of axons in radiatum (a) (n = 14 pairs), 
pyramidale (b) (m = 12 pairs, P< 0.01), or oriens (c) (n = 15 pairs). Open 
circles represent BDNF-KO/BDNF-WT pairs. Red circles indicate 

mean + s.e.m. 
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Extended Data Table 1 | Putative NPAS4 target genes 


gene chr pos induction of gene induction at peak gene chr pos induction of gene induction at peak 
4921504EO6Rik 2 19476120 8 6.2 Maml3 3 51906195 3.28369 575 
9530008L14Rik 13 41896890 8.6 21 Maml3 3 51907390 3.28369 113333 
Abcd2 15 90983260 2.48429 55 Mest 6 30695130 4.66716 6.274 
Adcyap1 17 +93602815 4.33656 5.929 Myo3b 2 69952395 2.02105 =| 
Ankrd56 5 93483350 15.9545 11 Nfil3 13 53078245 4.87197 T2S2 
Arhgap29 3 121676880 3.56853 5.429 Npas4 19 4989805 73,8489 58.538 
Bd nf 2 109532475 35:291S 205.333 Npas4 19 4993095 73.8489 19 
Bdnf 2 109518455 35.2919 4.333 Nptx2 5 145308275 7.35033 9.568 
Bd nf 2 109514675 35.2919 128 Nr4al 15 101097330 36.0876 66.4 
Cd6 19 10859575 2.11111 Bas | Nr4a2 2 56969220 6.71864 10.25 
Cpa4 6 30522930 5.70588 25:5 Nr4a3. 4 48057945 12.8444 10.559 
Cpxm2 7 139350455 2.56693 6.286 Nr4a3_ 4 48056410 12.8444 11.375 
Crem 18 3281155 5.14777 16.8 Nr4a3. 4 48057290 12.8444 12.034 
Crem 18 3281830 5.14777 22 Olfml2a 2 38806110 2.34211 LF 
Cyr61 3 145314740 4.5102 6 Pacsin3 2 91095860 2.12644 13 
D16Ertd472e 16 78576220 5.54462 12 Pam 1 99928620 3.52883 11.647 
D3Bwg0562e 3 117060470 4.03298 10 Pappa 4 64788065 2.88618 8 
Dbc1 4 68615695 4.20687 5.005 Pedhilx 20 117723835 2.41197 9 
Ddx3y 21 622975.5 3.93276 10 Peskl 13 75228705 37.6 13.4. 
Dkk2 3 131749835 9.25532 10.5 Pde4d 13 110612880 2.31523 50 
Dusp6 10 98786170 2.54083 9 Penkl 4 4065825 Bal 5.231 
E430004NO4Rik 10 28532080 8.13043 13 Perl 11 68913080 3.159 7.8 
Fer113 19 38066355 5.39535 74 Rabggtb 3 153574995 3.08718 6.065 
Fer113 19 37982665 5.39535 9 Rgs2 1 145852135 8.16283 10.181 
Fos 12 86812870 33,5814 14 Rgs2 1 145851380 8.16283 7.207 
Fosb 7 19905495 53 5.333 Rgs2 1 145850695 8.16283 8.76 
Fosb 7 19894115 53 20 Rspo3 10 29259485 2.27746 6 
Fosl2 5 32435940 5.29613 14 Scnla 2 66246895 2.98474 6.875 
Fosl2 5 32438065 5.29613 5.875 Sertadl 7 28271930 9.30631 7.526 
Gabra4 5 72008145 3.34333 5.5 Slc6al7 3 107319165 2.54003 5 
Gabrb1 5 72460260 2.17759 6 Snx7 -3.-—s- 117526220 2.07258 55 
Gent2 13 41013390 3.71429 6.2 Snx7 S ‘I17529915 2.07258 6.5 
Gpr22 12 32394835 5.68515 7.676 Sorcs3 19 48315650 2.76436 ai 
Gpr3 4 132767740 10.1849 15 Sorcs3 19 48659720 2.76436 3 
Hen1 13 118394055 2.98126 5:5 Sores3 19 48172075 2.76436 8 
Hdac9 12 35213780 8.31166 9.947 Spry2. 14 106297475 2.01437 14.75 
Hdac9 12 35214265 8.31166 S975 Stac 9 111591890 3.65815 5.083 
Hdac9 12 35182840 8.31166 5 sult2b1 13 96731800 4.05891 q 
Hdac9 12 35166755 8.31166 24.75 Sv2c 3 93251570 2 7 
Hdac9 12 35183850 8.31166 18 Tchh 8 66685070 6.68364 1379 
Hdac9 12 35173085 8.31166 5.31 TIL 9 53914130 2.51701 BY bo 
Hspb3 13 114453860 11.2703 10.947 Tnfaip8i3 1 162079810 11.2174 13 
Hunk 16 90387145 2.93033 5 Tnn 9 8544010 8.38462 17 
Iqsec3 6 121423910 2.03163 7.009 Trpc6 6 30871095 5.66667 6.75 
Junb 8 87502555 18.6986 19.684 Tsgal3 6 30871525 5.66667 6 
Kena4 2 107257495 3.2902 8.857 Tsgal3 6 30858185 5.66667 12 
Kenj3 2 55289600 5.1252 6.675 Tsgal3 8 74194390 2.24115 5.667 
Klhl4 20 111587385 6.72727 5.889 Uncl3a 5 137505265 13.7907 10.625 
Maff 15 79177910 10.8679 14 Vet 5 137506375 13.7907 14.077 
Maff 15 79178470 10.8679 6.5 Vet 9 119920665 13.7907 6.8 


Putative NPAS4 target genes were identified by analysing previously published NPAS4 ChIP-seq and RNA-seq data (accession number GSE21161)"*. Inducible genes were selected as putative NPAS4 targets if 
(1) by ChIP-seq NPAS4 is shown to be bound within 10 kb of the putative target gene and (2) by RNA-seq the mRNA transcribed near the NPAS4 peak is induced greater than fivefold. 
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Integrin-modulating therapy prevents fibrosis and 
autoimmunity in mouse models of scleroderma 


Elizabeth E. Gerber', Elena M. Gallo!, Stefani C. Fontana’, Elaine C. Davis’, Fredrick M. Wigley’, David L. Huso* & Harry C. Dietz)° 


In systemic sclerosis (SSc), a common and aetiologically myster- 
ious form of scleroderma (defined as pathological fibrosis of the 
skin), previously healthy adults acquire fibrosis of the skin and 
viscera in association with autoantibodies’. Familial recurrence is 
extremely rare and causal genes have not been identified. Although 
the onset of fibrosis in SSc typically correlates with the production 
of autoantibodies, whether they contribute to disease pathogenesis 
or simply serve as a marker of disease remains controversial and the 
mechanism for their induction is largely unknown’. The study of 
SSc is hindered by a lack of animal models that recapitulate the 
aetiology of this complex disease. To gain a foothold in the patho- 
genesis of pathological skin fibrosis, we studied stiff skin syndrome 
(SSS), a rare but tractable Mendelian disorder leading to childhood 
onset of diffuse skin fibrosis with autosomal dominant inheritance 
and complete penetrance. We showed previously that SSS is caused 
by heterozygous missense mutations in the gene (FBN1) encoding 
fibrillin-1, the main constituent of extracellular microfibrils*. SSS 
mutations all localize to the only domain in fibrillin-1 that har- 
bours an Arg-Gly-Asp (RGD) motif needed to mediate cell-matrix 
interactions by binding to cell-surface integrins’. Here we show 
that mouse lines harbouring analogous amino acid substitutions 
in fibrillin-1 recapitulate aggressive skin fibrosis that is prevented 
by integrin-modulating therapies and reversed by antagonism of 
the pro-fibrotic cytokine transforming growth factor B (TGF-B). 
Mutant mice show skin infiltration of pro-inflammatory immune 
cells including plasmacytoid dendritic cells, T helper cells and plasma 
cells, and also autoantibody production; these findings are normal- 
ized by integrin-modulating therapies or TGF-B antagonism. These 
results show that alterations in cell-matrix interactions are suf- 
ficient to initiate and sustain inflammatory and pro-fibrotic pro- 
grammes and highlight new therapeutic strategies. 

Fibrillin-1 contributes to the regulation of TGF-B, a cytokine that 
has been descriptively linked to many fibrotic diseases, including both 
SSS and SSc**. TGF-B is secreted from the cell in the context of a large 
latent complex (LLC) that includes the active cytokine bound to a 
dimer of its processed amino-terminal propeptide, latency-associated 
peptide (LAP), which in turn binds to latent TGF-}-binding proteins 
(LTBPs)°. Studies in mouse models and in vitro have shown that 
fibrillin-1 interacts directly with LTBPs, allowing sequestration of 
the LLC by microfibrils’. 

Mutations throughout the FBN1 gene also cause Marfan syndrome 
(MEFS), a disorder characterized by bone overgrowth, ocular lens dis- 
location and aortic dilatation’. Failed matrix sequestration of the LLC 
in fibrillin-1-deficient patients and mice promotes increased activation 
of, and signalling by, TGF-f. SSS mutations are specifically localized to 
the fourth transforming growth factor-B-binding protein-like domain 
(TB4) of fibrillin-1, which encodes the RGD motif, through which 
fibrillin-1 binds integrins ,B3, «5B, and %,B. (refs 3, 5). 

To determine whether failed interaction between integrins and fibrillin-1 
is sufficient to initiate skin fibrosis, two Fbn1-targeted knock-in mouse 


models were generated: one with SSS-associated change W1572C (the 
mouse equivalent of human W1570C) and the other with an RGD to 
RGE substitution (D1545E) predicted to cause an obligate loss of inte- 
grin binding to fibrillin-1 (Supplementary Fig. 1). Mice heterozygous 
for either mutation phenocopy SSS with increased deposition of col- 
lagen by 1 month of age and a decrease in subcutaneous fat by 3 months 
of age (Fig. 1 and Supplementary Fig. 2a, b). Whereas homozygosity for 
D1545E causes embryonic lethality before embryonic day 10.5, mice 
homozygous for W1572C are viable and show accelerated skin fibrosis 
when compared with heterozygous littermates (Fig. 1 and Supplemen- 
tary Fig. 2a, b). As seen in patients with SSS or SSc?, mutant mice show 
disorganized and excessive microfibrillar aggregates in the dermis, 
with sparsely distributed elastin (Supplementary Fig. 2c). Freshly iso- 
lated cells from mutant dermis show increased surface levels of integ- 
rins &;$, and «,f; in its active conformation (as assessed using WOW- 
1 antibody) by flow cytometry (Fig. 2a). There was no corresponding 
increase in either total B; integrin or integrin B;, a subtype that can 
cross-react with WOW-1 (Supplementary Fig. 3). On the basis of these 
data, we speculated that disrupted cell—-matrix interaction in SSS might 
result in compensatory upregulation of specific integrins at the surface 
of dermal cells, and that integrins might represent a possible thera- 
peutic target for this disease. 

We next investigated whether mimicking integrin—matrix ligand 
(that is, fibrillin-1) interactions in mutant mice by using a , integrin- 
activating antibody (B,aAb; 9EG7) offered therapeutic potential for 
the treatment of SSS. Treatment of SSS mouse models with B,aAb for 
12 weeks normalized integrin expression, skin stiffness and distensi- 
bility and skin architecture (Fig. 2a—c and Supplementary Fig. 4). In 
keeping with a pathogenic role for B3 integrin, we found that targeted 
introduction of haploinsufficiency or the complete null state for the B; 
integrin gene (Itgb3) in SSS mice normalized skin stiffness, collagen 
deposition and subcutaneous fat by 3 months of age (Supplementary 
Fig. 5a,b). By 5 months of age, 8 of 67 (12%) Itgb3-targeted animals 
developed focal dermal and epidermal thickening (irrespective of Fbn1 
genotype) reminiscent of the aberrant wound-healing described prev- 
iously in B; integrin-deficient mice (Supplementary Fig. 5c)’. 

To assess for a pathogenic contribution for TGF-B, SSS mice were 
treated for 12 weeks with a panspecific TGF-f-neutralizing antibody 
(TGF-BNAb; 1D11) or isotype-matched control IgG after establish- 
ment of dense fibrosis at 12 weeks of age. Clinical (Fig. 3a) and histo- 
logical (Fig. 3b) findings confirmed full reversal of skin stiffness and 
restoration of skin architecture in TGF-BNAb-treated animals. 
Potential mechanisms for enhanced TGF-B activity include excessive 
concentration of latent TGF-B by the abnormally abundant microfi- 
brillar aggregates in the dermis, or excessive integrin-mediated activation 
(release) of TGF-B from its latent complex*. To discriminate between 
these possibilities, we used flow cytometry to monitor mutant mice for 
increased cell-surface expression of the three integrin subtypes (%,B5, 
oyB.6 and o,8s) known to support potent TGF-f activation’; this was 
not observed (Supplementary Fig. 6a). In addition, immunofluorescence 
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Baltimore, Maryland 21205, USA. 5Howard Hughes Medical Institute, Chevy Chase, Maryland 20815, USA. 
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Figure 1 | SSS mouse models show skin fibrosis. a, Masson’s trichrome 
staining of back skin sections from male mice (genotypes indicated) at 1 month 
of age (top panels) and 3 months of age (bottom panels) shows progressive 
loss of subcutaneous fat and an expanded zone of dense dermal collagen in 
mutant animals. b, Quantification of the thickness of the zones of dermal 
collagen and subcutaneous fat in wild-type and mutant mice at 1 month of age 
(top panels) and 3 months of age (bottom panels) is shown. Similar findings 
were observed in mutant female mice (Supplementary Fig. 2a, b). DE, D1545E; 
WC, W1572C. For 1-month males, n = 9 (+/+), 10 (WC/+), 10 (WC/WC) 
and 9 (DE/+); for 3-month males: n = 13 (+/+), 9 (WC/+), 9 (WC/WC) and 
9 (DE/+). Scale bars, 50 um. +P < 0.001; {P < 0.0001. The upper and lower 
margins of each box define the 75th and 25th centiles, respectively; the internal 
line defines the median, and the whiskers define the range. Values outside the 
interquartile range are shown as open circles. 


analysis of skin in mutant mice did not reveal increased expression of 
free TGF-B1 (Supplementary Fig. 6b), which is known to be activated 
by integrins through interaction with the RGD sequence in its LAP 
(LAP1). There was an increase in total (free and active) TGF-B2 
(Supplementary Fig. 6b), which has not been shown to be activated 
by integrins (presumably as a result of the absence of an RGD sequence 
in LAP2)°. Furthermore, there was excessive concentration of both LAP1 
and LAP2 in the dermis of mouse models of SSS, suggesting accumula- 
tion of the LLC for TGF-B1 and TGF-£2, respectively. Although we 
cannot exclude a contribution of integrin-mediated TGF-f activation, 
these data suggest that enhanced TGF-f bioavailability contributes 
prominently to increased TGF-f activity in mutant mice. 

As seen in SSc, SSS mouse models show circulating anti-nuclear and 
anti-topoisomeraseI antibodies (Fig. 4a and Supplementary Fig. 7). 
The finding that the deep dermal fibrosis seen in early SSS (Fig. 1) co- 
localizes with high expression of active B3 integrin and accumulation 
of CD45* marrow-derived cells (Supplementary Fig. 8a) prompted 
speculation that an infiltrating class of immune cells might contribute to 
disease progression. In keeping with this hypothesis, nearly all dermal cells 
expressing high levels of «;8, and active B; integrins in SSS mice are CD317~ 
plasmacytoid dendritic cells (pDCs) (Fig. 4b). SSS mice show enrich- 
ment for cells that are CD11b CD3 CD19 B220* SiglecH * Ly6C™®" 
in the dermis, further validating this identity (Fig. 4b and Supplementary 
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Fig. 8b-d)’. As is characteristic of mature and active pDCs, these 
dermal cells express the pro-inflammatory cytokines interleukin (IL)-6 
and interferon (IFN)-« (Fig. 4c and Supplementary Fig. 8e)°. There is 
also dermal polarization towards pro-inflammatory T helper (Ty) cell 
populations, including CD4*IL-4* Tr2, CD4*IL-17* Tyl7 and 
CD4"IL-9* Ty,9 cells (Supplementary Fig. 9a, b). In keeping with skew- 
ing towards Ty2, Ty9 and/or T};17 cells, there was also an increased 
expression of IL-9, IL-13 and IL-22 by CD3* dermal cells (Sup- 
plementary Fig. 9a-c). There was no corresponding increase in either 
IFN-y*CD4* Tyl or FoxP3*CD4* T-regulatory (Treg) cells in 
mutant animals (Supplementary Fig. 9d). Last, the dermis of SSS mice 
also showed infiltration with B220°®*CD19~ activated B cells and 
CD138*B220'°”CD19* plasma cells (Supplementary Fig. 10). These 
abnormalities, including circulating autoantibodies and immune cell 
infiltration and activation, were normalized by treatment of mutant 
mice with B,aAb (Fig. 4 and Supplementary Figs 9 and 10). A similar 
response was seen in association with reversal of skin fibrosis on treat- 
ment with TGF-BNAb (Supplementary Fig. 11). 

We speculated that altered presentation of the fibrillin-1 RGD 
sequence might directly influence the expression of integrins by, and 
the performance of, pDCs. In keeping with this hypothesis, we found 
that wild-type spleen-derived pre-pDCs showed increased adherence 
and activation (expression of IFN-« and IL-6) when plated on the 
matrix expressed by SSS murine embryonic fibroblasts (MEFs) in 
comparison with control MEFs (Supplementary Fig. 12). 

SSc fibroblasts showed increased cell-surface presentation of total B, 
integrin (Supplementary Fig. 13) and active B; integrin (as monitored 
by WOW- 1 staining) in comparison with controls, whereas levels of total 
B; and B; integrins were normal (Supplementary Fig. 14a). Treatment 
with B;aAb TS2/16, which promotes and stabilizes integrin B-ligand 
interactions, normalized cell-surface levels of active B; integrin (Sup- 
plementary Fig. 14a). Treatment with B, integrin-blocking antibody 
(B3bAb) did not significantly decrease the cell-surface presentation of 
total B, integrin (Supplementary Fig. 13). Human SSc cells in culture 
showed decreased levels of microRNA-29 (miR-29) (Supplementary 
Fig. 14b), a small regulatory RNA that is repressed by TGF-B and is 
known to inhibit the expression of multiple matrix elements and to 
suppress fibrosis in selected disease states'®”’. Treatment with B,aAb 
normalized miR-29 expression and dose-dependently attenuated the 
expression of typeI and type III collagen in SSc fibroblasts (Sup- 
plementary Fig. 14b). SD208, an antagonist of the kinase activity of 
the type I TGF-B receptor subunit, also normalized the expression of 
collagen and miR-29a (Supplementary Fig. 14c). 

In addition to canonical (Smad-dependent) signalling, TGF-B can 
also initiate so-called non-canonical cascades, prominently including 
extracellular signal-regulated kinase (ERK1/2)°. SSc fibroblasts showed 
normal TGF-1-dependent phosphorylation of Smad3 (pSmad3) that 
was not influenced by integrin-modulating therapies, but uniquely showed 
TGF-f1-dependent phosphorylation of ERK1/2 (pERK1/2) in con- 
trast with control fibroblasts that was normalized on treatment with 
either B,aAb or B3bAb (Supplementary Fig. 14d, e). The activation of 
ERK1/2 in SSc fibroblasts was seen within 5 min of stimulation with 
TGF-B1 and was inhibited by pretreatment with SD208, suggesting a 
relatively direct response (Supplementary Fig. 14d, e). In keeping with 
a pathogenic contribution of pERK1/2, treatment of SSc fibroblasts 
with U0126, an inhibitor of the mitogen-activated protein kinase/ 
ERK kinase (MEK), increased miR-29a levels and decreased collagen 
expression in SSc fibroblasts (Supplementary Fig. 14f). Both SSS mouse 
models showed excessive activation of ERK1/2 in CD317* pDCs and 
other dermal cells (Supplementary Fig. 14g). Treatment of Fbn1?°F/* 
mice with the MEK inhibitor RDEA119 prevented skin stiffness, the 
accumulation of dermal collagen and the loss of subcutaneous fat 
(Supplementary Fig. 14h, i). 

This study shows that point mutations specifically in the sole integrin- 
binding domain of fibrillin-1 are sufficient to recapitulate the SSS pheno- 
type in mice and to initiate many findings reminiscent of SSc, including 
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Figure 2 | Integrin-modulating interventions prevent skin fibrosis. a, Flow 
cytometry of cells derived from the dermis reveals a unique population 
expressing both «58, and active B; integrins (monitored using WOW-1 
antibody) in mutant mice that is eliminated on treatment with B,aAb but not an 
isotype-matched control (IgG). Representative contour plots (left) and 
quantification (right) are shown. An agonist and antagonist of B; integrin 
activation were used to attest to the specificity of the WOW-1 antibody 
(Supplementary F ig. 16b). For isotype control-treated animals, n = 5 (Fbn1*!*) 
and 7 (Fbn1?!°*"'*); for B,aAb-treated animals, n = 4 (Fbn1*!*) and 7 
(Fbn1?'>45E/*). b, Clinical assessment demonstrated that B,aAb prevented 
skin stiffness in mutant animals when compared with those treated with an 


dermal fibrosis, autoantibody production, high IFN-« expression, T}42 
and Ty17 polarization, and accumulation of activated Bcells and 
plasma cells in the skin’**!*"*, Although previous studies have 
reported autoantibodies and subdermal fibrosis in tight skin (Tsk) 
mice harbouring a large central duplication in Fbn1, there are no direct 
human correlates, and both the mechanism and pathogenic relevance 
remain unclear’*”’. In SSS, all of these processes can be functionally 
linked to altered integrin expression and/or function because they are 
prevented by integrin-modulating therapies. Although skin fibrosis 
was observed in mice on conditional silencing of B; integrin expression 
in keratinocytes’®, targeting of Itgb1 in fibroblasts afforded relative 
protection against bleomycin-induced skin fibrosis'’. This apparent 
discrepancy has not been explained mechanistically. 

A comparison of MFS and SSS highlights the complicated role of the 
extracellular matrix in cytokine regulation. Unlike MFS, in which a 
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Figure 3 | A panspecific TGF-f-neutralizing antibody reverses established 
skin fibrosis. a, Clinical assessment showing that stiffness was fully normalized 
by treatment with TGF-BNAb, starting at 3 months of age and continuing 
for 12 weeks. b, Histological and morphometric analyses using Masson’s 
trichrome stain. For isotype control-treated animals, n = 14 (Fbn1 are ry. 

9 (Fbn1?'*F/*) and 8 (Fbn1 1°"); for TGF-BNAb-treated animals, 


128 | NATURE | VOL 503 | 7 NOVEMBER 2013 


Dermal GE (um) 


Font? Fn101545E/> Fp 1Wis72C/+ Font+ Fn 191545E/+ Fp 1W1572C/+ 


isotype-matched control (IgG). c, Masson’s trichrome staining reveals 
decreased skin collagen and preservation of subcutaneous fat in B;aAb-treated 
mutants (left). Quantification of the thickness of the zones of dermal collagen 
and subcutaneous fat is shown (right). For isotype control-treated animals, 
n= 12 (Fbn1*'*), 9 (Fbn1?5"'*) and 8 (Fbn1'°”*'*); for BaAb-treated 
animals, n = 12 (Fbn1*!*), 10 (Fbn1?'°*¥'*) and 10 (Fbn1“°?“'*), Scale 
bars, 50 um. *P < 0.05; **P < 0.01; tP < 0.001; {P < 0.0001. The upper and 
lower margins of each box define the 75th and 25th centiles, respectively; the 
internal line defines the median, and the whiskers define the range. Values 
outside the interquartile range are shown as open circles. 


deficiency of fibrillin-1 is seen, SSS mutations promote the increased 
deposition of abnormal microfibrillar aggregates that fail to make 
contact with neighbouring cells but retain the ability to bind to the 
TGF-B LLC, findings also seen in SSc’. This results in a decreased or 
increased concentration of latent TGF-B in tissues in MFS or SSS, 
respectively*. It is posited that in MFS a decreased concentration of 
LLC is offset by increased TGF-f activation but that this may occur ina 
tissue-specific manner®®. The relative deficiency of microfibrils and 
hence latent TGF-B in MFS would mandate continuing TGF-f pro- 
duction to support high signalling, whereas the high dermal concen- 
tration of TGF-B in SSS might allow a more sustained enhanced 
signalling state. Curiously, this does not seem to occur in all tissues 
in which fibrillin-1 is expressed, perhaps as a result of different reper- 
toires of expressed integrin subtypes that vary in their sensitivity to 
conformational changes induced by SSS mutations, and/or tissue-specific 
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Figure 4 | Immunological abnormalities in SSS mice are prevented by 
integrin-modulating therapies. a, Increased circulating levels of anti-nuclear 
and anti-topoisomerase I antibodies by enzyme-linked immunosorbent assay 
in Fbn1?'°"/* mice at 3 months of age are normalized on treatment with 
B,aAb but not when treated with an isotype-matched control (IgG). For isotype 
control-treated animals, n = 6 (Fbn17'*), 4 (Fbon1?54!*). for B,aAb-treated 
animals, n = 4 (Fbn1*/~), 10 (Fbn1?!**°"'*), b, The cells expressing high o5B, 
integrin in the dermis of mutant mice are CD317*" cells that fail to accumulate 
on treatment with §,aAb but not with an isotype-matched control (IgG). 


differences in the regulation of microfibrillar assembly. The stiffened 
ECM in SSS could support mechanical traction-based activation of the 
excessive amounts of latent TGF-B in the dermis, a plausible feedfor- 
ward mechanism for the observed fibrosis*. Thus the level of TGF-B 
signalling in a given tissue may, at least in part, be determined by the 
integration of both positive and negative regulation by microfibrils*®. 

Although the cell type that first detects and responds to aberrant 
presentation of the RGD sequence in fibrillin-1 remains unknown, it is 
useful to speculate on the involvement of pre-pDCs that normally 
perform a surveillance function for viral pathogens at low concentra- 
tions in the skin. Previous work has shown that «58, integrin influences 
the adhesion, migration and maturation of DCs and that migration is 
inhibited by B,aAb, at least in part through podosome disassembly”. 
Furthermore, a specific role for «58, integrin in pre-pDC chemotaxis 
and trafficking has been demonstrated”. It is therefore evident that 
pre-pDCs are informed by and respond to their matrix environment, 
with fibrillin-1 potentially serving as a prominent informant. In keep- 
ing with this, our in vitro observations (Supplementary Fig. 12) suggest 
that an altered matrix environment, devoid of any systemic influence, 
is sufficient to promote pDC recruitment and activation. It remains to 
be determined whether this relates to the loss of a physiological inhib- 
itory signal by normal microfibrils or to a pathogenic gain of function 
by abnormal microfibrillar aggregates seen in SSS and SSc. 

pDCs are a major source of IFN-& and are capable of inducing Ty2 
and T};17 skewing, autoreactive B-cell and plasma-cell differentiation, 
and autoantibody production (Supplementary Fig. 15)”'??°”; they 
have also previously been implicated in multiple autoimmune processes 
(including SSc)*'?°-**, Although pDCs can contribute to both tolero- 
genic T,.¢ or auto-inflammatory Ty17 cell commitment, in vitro 
experiments suggest that TGF-fB-treated pDCs favour the latter 
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and IL-6 (c). In b and ¢, for isotype control-treated animals, n = 5 (Fbn1 tet) 
and 7 (Fbn1?>*/*), for B,aAb-treated animals, n = 4 (Fbn1*'*) and 

7 (Fon1P PF), *P < 0,05; **P < 0.01; tP < 0.001; {P < 0.0001. The upper 
and lower margins of each box define the 75th and 25th centiles, respectively; 
the internal line defines the median, and the whiskers define the range. Values 
outside the interquartile range are shown as open circles. 


through a Smad-dependent mechanism”*. Whereas the altered matrix 
environment in SSS probably contributes to excessive TGF-B activity 
early in the course of disease, TGF-B induces its own production and 
activation by pDCs, as well as IL-6 secretion (known prerequisites 
for Ty17 polarization)**. pDCs can also induce either Ty1 or Ty2 
skewing through IL-6/IFN-a-dependent or OX40 ligand (OX40L)/ 
IL-4-dependent mechanisms, respectively (Supplementary Fig. 15)’. 
pDCs in a Ty2 environment become activated and show enhanced 
IL-4 secretion, constituting a potential feedforward mechanism for 
maintenance of a T};2 response™. In the context of high TGF-f sig- 
nalling, this might also allow for T};9 skewing”. Cytokines related to 
Ty2, Ty17 and pDCs, including IL-4, IL-6, IL-13, IL-17 and IFN-«, 
have been prominently implicated in the fibrotic response in various 
disease states, including SSc'**?"*"*°-”_ This study implicates TGF-B 
in pDC recruitment. 

Although many studies have highlighted the contribution of integ- 
rins to fibrotic disease®, their focus has been on the ability of certain 
integrins to release (activate) TGF-B1 or TGF-B3 from the LLC 
through a direct interaction with RGD sequences in LAP1 and LAP3 
(ref. 8). Multiple observations in this study suggest that enhanced 
bioavailability of TGF-f, rather than its activation, may be the primary 
determinant of increased TGF-B activity in SSS and perhaps SSc. Our 
in vitro data in SSc fibroblasts suggest that cell-surface integrins can 
influence the inherent signalling properties of the TGF-B receptor 
complex in response to free and active TGF-f. Although the initiating 
pathogenic event in SSc remains unknown, this study provides evid- 
ence for a cell autonomous signalling defect that is maintained in 
culture. In theory, this could relate to primary but poorly penetrant 
genetic alterations or fixed epigenetic modifications, both of which 
may require a large environmental trigger. 
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Activation of ERK1/2 has previously been implicated in the TGF-f- 
mediated fibrotic response in general, and specifically in SSc fibroblasts****. 
It was previously observed”* that constitutive ERK1/2 signalling in SSc 
fibroblasts drives the expression of integrin #,[3. Both «,B3; and TGF-B 
were required for excessive collagen production. Despite overlapping 
observations and the common conclusion that «83 represents an 
attractive therapeutic target, this study places ERK1/2 activation 
downstream of both TGF-B and enhanced active «$3 expression in 
SSc fibroblasts and shows phenotypic rescue in response to ERK ant- 
agonism in an in vivo model of scleroderma. Furthermore, we show 
prominent ERK1/2 signalling in pDCs in SSS mice, a described pre- 
requisite for the stabilization, nuclear export and translation of IFN-« 
messenger RNA”’, and for Toll-like receptor-mediated expression of 
inflammatory cytokines*’. Whereas previous work associated low levels 
of miR-29, a negative regulator of collagen expression, with fibrotic 
diseases including post-injury cardiac fibrosis'® and SSc"’, this study offers 
a pathogenic sequence for scleroderma that integrates structural 
matrix elements, integrins, TGF-B signalling, ERK activation and 
miR-29. 

SSS mouse models demonstrate the potential to reverse established 
dermal fibrosis, and they suggest several therapeutic strategies includ- 
ing §, integrin activation and blockade of signalling by B; integrin, 
TGF-B or ERK. When paired with the ability to perform preclinical 
trials in the first described mouse models of a genetically defined 
human presentation of scleroderma, the potential for therapeutic 
advancement seems promising. 


METHODS SUMMARY 

Subjects. Patients were recruited from the Scleroderma Center and Connective 
Tissue Clinic at Johns Hopkins Hospital. All skin biopsies and protocols were 
performed in compliance with the Johns Hopkins School of Medicine Institutional 
Review Board after informed consent. 

Mice. All mice were cared for in strict compliance with the Animal Care and Use 
Committee of the Johns Hopkins University School of Medicine. Fbn1?!*45"/* 
and Fbn1'5C'* mice were generated by homologous recombination. Itgb3*/~ 
mice were purchased through Jackson Laboratories as heterozygotes. Skin biopsy, 
sera collection, enzyme-linked immunosorbent assay (ELISA), as well as histolo- 
gical and flow cytometric analysis and in vivo trials were performed as described in 
Methods. 

Cell culture. Primary human dermal fibroblasts were derived from skin biopsies 
from five patients with active diffuse systemic sclerosis and from six healthy 
controls. All antibodies and in vitro drug treatments and dosing, as well as meth- 
ods for flow cytometric analyses, western blotting, RNA isolation and quantitative 
PCR are described in Methods. All details regarding co-culture of pDCs and 
murine embryonic fibroblasts are described in Methods. 

Statistics and graphs. All quantitative data are shown as standard boxplots pro- 
duced in R statistical software. Statistical analysis was performed with a two-tailed 
t-test, assuming equal variance between the compared groups. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Participants. Patients were recruited from the Scleroderma Center and Connective 
Tissue Clinic at Johns Hopkins Hospital (F.M.W. and H.C.D.). All skin biopsies 
and research protocols were performed in compliance with the Johns Hopkins 
School of Medicine Institutional Review Board and after informed consent. 
Mice. All mice were cared for in strict compliance with the Animal Care and Use 
Committee of the Johns Hopkins University School of Medicine. Fbn1?°°"/* 
and Fbn1'°”?“'* mice were generated by homologous recombination as described 
in the next section. Itgb3 */~ mice were purchased through Jackson Laboratories as 
heterozygotes. All experimental mice were on a mixed C57BL/6J and 129/SvEv 
background. To minimize potentially confounding background effects, all com- 
parisons between genotypes and between treatment arms within a genotype were 
made between sex-matched littermates; no other randomization procedures were 
applied. Sample size estimates for the assessment of therapeutic responses were 
based on the penetrance and severity of the specific parameter under consideration 
in untreated cohorts. 

Generation of Fbn1?!*S"* and Fbn1°?(* mice. Fbn1?!°S"/* and Fon 9?0* 
mice were generated by homologous recombination (Supplementary Fig. 1a). A 
10-kilobase Fbn1 fragment was generated by PCR from mouse genomic tail DNA, 
digested with Acc65 and Nhel restriction enzymes (NEB), and ligated into pSL301 
(Invitrogen Corp.). Site-directed mutagenesis was performed with the QuikChange 
mutagenesis kit (Stratagene Inc.), creating either the D1545E or W1572C muta- 
tion. The targeting vector was assessed by sequence analysis. Site-directed muta- 
genesis was again performed to remove the AaflI restriction site from pSL301. The 
NeoR cassette was amplified from pEGFP-C1 (Invitrogen Corp.) and the amplicon 
was subcloned into pCR2.1-TOPO (Invitrogen Corp.). A fragment containing the 
AatiI restriction site and NeoR, with flanking loxP sequences, was subcloned into a 
unique AailI site in the Fbn1 intron before exon 38. The sequences of the loxP 
sites and site-directed mutagenesis-created mutations were confirmed by direct 
sequencing. The vector was linearized using a unique (NruI) site and electropo- 
rated into R1 embryonic stem cells. Positive clones were identified by Southern blot 
analysis (Supplementary Fig. 1b) as described previously’’. Positive clones were 
injected into 129/SvEv blastocysts at embryonic day 3.5 and transferred into pseu- 
dopregnant females. Chimaeric offspring were mated to C57BL/6J mice, and germ- 
line transmission was observed for at least three independent targeting events for 
each genotype. All exons encompassed by and immediately flanking the targeting 
vector were analysed by sequencing of PCR-amplified genomic DNA derived from 
mutant animals to demonstrate the fidelity of targeting. Complete concordance of 
phenotype for three or two independent lines for mutations W1572C or D1545E, 
respectively, excluded any major off-target effect. Mice were genotyped on the 
basis of creation of a new Acil site (W1572C) or destruction of a BsmAI site 
(D1545E) in correctly targeted mice (Supplementary Fig. 1c). Primers used for 
amplification were 5’-GATCCCACCACCTGCATC-3’ (sense) and 5'-CATGTG 
TTCACAGAAGGACAC-3’ (antisense). The loxP-flanked NeoR was removed by 
breeding Fbn1P?"/* and Fbn1™'5”?“'* mice with transgenic mice that ubiqui- 
tously expressed Cre recombinase using a Ella promoter, purchased through 
Jackson Laboratories. More than 85 embryos were genotyped at embryonic day 
10.5 for Fbn1?°"* homozygosity. 

In vivo drug treatment. All antibodies used to treat mice or cells were azide-free. 
Male mice were treated with B, integrin-activating antibody (B,aAb, Rat Clone 
9EG7; special-ordered more than 98% pure and azide-free from BD Biosciences) 
or an isotype-matched control (Rat IgG2a, k; special-ordered more than 98% pure 
and azide-free from BD Biosciences) by intraperitoneal injection at 2mgkg ' 
every 5 days for 12 weeks, beginning at 1 month of age. Complete blood cell counts 
were performed to exclude pancytopenia in B,aAb-treated animals (Supplemen- 
tary Fig. 16a). For the TGF-B-neutralizing trial, 3-month-old male mice were 
treated with pan-specific TGF-[-neutralizing antibody (Mouse Clone 1D11, cata- 
logue no. MAB1835; R&D) or an isotype control (Mouse IgG1, clone 11711, 
catalogue no. MAB002; R&D) by intraperitoneal injection at 10mgkg ' on 
alternate days for 12 weeks. RDEA119 was provided by C. J. Thomas, S. Patnaik 
and J. J. Marugan (National Institutes of Health Chemical Genomics Center). 
RDEA119 was reconstituted in 10% 2-hydroxypropyl-fB-cyclodextrin (Sigma- 
Aldrich) dissolved in PBS, and was administered twice daily by oral gavage at a 
dose of 25 mgkg  *. Treatment was initiated at 1 month of age and continued for 
8 weeks. 2-Hydroxypropyl-f-cyclodextrin (10% in PBS) was administered as a 
control. Given the absolute concordance regarding pathology and therapeutic 
responses for Fbn1?'*8"!* and Fbn1“°”“'* mice seen early in this study, later 
studies focused on Fbn1 mice to limit the expense associated with in vivo 
antibody (TGF-BNAb and B,aAb) and drug (RDEA119) trials. 

Stiffness scoring. A clinical stiffness score was assigned by five observers blinded 
to genotype and treatment status. Mice were assessed in random order. A score of 1 
indicates no stiffness (that is, identical to wild-type mice). A score of 4 indicates 
extreme stiffness based on previous experience with untreated SSS mice; 2 and 3 
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indicate a subjective assessment of an intermediate level of stiffness. Early in the 
course of studies, the same mice were assessed by the same observer on a different 
day. This pilot demonstrated excellent intra-observer concordance. To measure 
stretched skin area and total surface area, mice were anaesthetized with isoflurane, 
and the back skin was shaved and treated briefly with Nair cream. Area measure- 
ments were performed with NIH image J software (National Institutes of Health). 
Mice were then suspended briefly with forceps secured to the back skin by a clamp 
and photographed in profile in a uniform manner (Supplementary Fig. 4a,b). 
There were no differences in body weight between all experimental groups 
(Supplementary Fig. 4c). 

Histology. For tissue analysis, animals were killed with inhalational halothane 
(Sigma) or anaesthetized with isoflurane. Back skin was shaved and treated briefly 
with Nair cream before biopsy. Fixed skin was paraffin-embedded, sectioned, and 
stained with a standard Masson’s trichrome stain. Dermal and subcutaneous fat 
thicknesses were measured under high-power fields as described previously”. 
Immunofluorescent staining was performed on frozen sections as described prev- 
iously*’. Active o,f 3 was detected with the WOW-1 antibody (gift from S. Shattil) 
and an anti-mouse Alexa Fluor-594 F(ab’), fragment secondary antibody (cata- 
logue no. A11020, Invitrogen). Other antibodies used included anti-CD45 antibody 
(catalogue no. 550539, BD), anti-Siglec H (catalogue no. 14-0333-81, ebiosciences) 
and antibodies against LAP1 (catalogue no. 141402, BioLegend), LAP2 (catalogue 
no. LS-C137100, Lifespan BioSciences), active TGF-B1 (Clone LC(1-30), gift from 
K. Flanders), and total TGF-f2 (catalogue no. ab66045, abcam). With the excep- 
tion of WOW-1, all other antibodies were conjugated by means of an amine-based 
Alexa Fluor antibody labelling kit (catalogue nos A-20181, A20187, A-20185 and 
A-20186, Invitrogen). 

Electron microscopy. Electron microscopy was performed as described previously”. 
Enzyme-linked immunosorbent assay. Mouse sera were collected and enzyme- 
linked immunosorbent assays (ELISAs) were performed with the Mouse Anti- 
Nuclear Antigens and Mouse Anti-Scl70kits (catalogue nos 5210 and 6110, 
AlphaDiagnostic) in accordance with the manufacturer’s instructions. 

Cell culture. Primary human dermal fibroblasts were derived from skin biopsies 
from five patients with active diffuse systemic sclerosis and from six healthy con- 
trols. Biopsies were taken from the forearm and cultured as described previously’. 
All experiments were performed in cell lines at low (less than 5) passage. Primary 
MEFs were derived from embryonic day 13.5 embryos as described previously”. 
Murine pDCs were isolated from the spleens of wild-type C57B16/J mice by using 
the Plasmacytoid Dendritic Cell Isolation Kit II (catalogue no. 130-092-786, 
Miltenyi Biotec) and a midiMACS Separator (catalogue no. 130-042-302, Miltenyi 
Biotec) in accordance with the manufacturer’s instructions. The pDC-containing 
cell suspensions routinely had more than 95% purity, as detected by flow cytometry. 
For MEF/pDC co-culture experiments, MEFs were cultured to complete confluence 
in culture medium containing RPMI 1640, 100 pg ml! streptomycin, 100 U ml! 
penicillin, 2 mM L-glutamine (Gibco) and 10% heat-inactivated fetal calf serum. At 
72h after confluence, 5 X 10* murine splenic pDCs ml’ were plated onto MEF 
monolayers. After 72h of co-culture, both adherent and non-adherent cellular 
fractions were harvested, counted, and analysed by flow cytometry. 

Flow cytometric analysis. Mouse skin was digested for flow cytometric analysis as 
described previously*®. On average, 4 X 10° cells were obtained from a 1 X 2. cm* 
piece of skin for wild-type mice, and 8 X 10° cells were obtained from either SSS 
mouse model. Murine Fc receptors were blocked using antibodies against mouse 
CD16/32 antigens (catalogue no. 553141, BD Biosciences). Murine plasmacytoid 
dendritic cells were isolated as reported previously’. All isolated cells (including 
murine dermal cells, cultured MEFs, splenic murine pDCs or human dermal 
fibroblasts) were stained and fixed using the BD Cytofix/Cytoperm system (cata- 
logue no. 554722, BD Biosciences). Data were acquired using CellQuest-Pro soft- 
ware on a FACSCalibur flow cytometer or BD FACSuite software on a FACSVerse 
flow cytometer (BD Biosciences). Data were analysed and all flow cytometry plots 
were contour plots (with outliers) that were generated with FlowJo software 
(TreeStar). For histograms, FlowJo software divides all events into 256 ‘bins’, 
which are numerical ranges for the parameter on the x axis. The percentage of 
maximum (y axis) is the number of cells in each bin divided by the number of cells 
in the bin that contains the largest number of cells. Gating for live cells was based 
on staining with the LIVE/DEAD Fixable Dead Cell Stain Kit (catalogue no. 
134955, Invitrogen). All staining was performed with fluorophore-conjugated 
primary and isotype control antibodies. All antibodies were either purchased as 
fluorochrome conjugates or conjugated by means of amine-based Alexa Fluor 
antibody labelling kits (catalogue nos A-20181, A20187, A-20185 and A-20186, 
Invitrogen). Mouse and human active 0.3 was detected with fluorophore-con- 
jugated WOW-1 antibody (gift from S. Shattil). EDTA (10 mM) and 2 mM MnCl, 
were used as negative and positive controls, respectively, for %f3 activation in flow 
cytometry experiments (Supplementary Fig. 16b)**’. Integrin «,B;, a subtype 
known to react with the WOW-1 antibody”’, was monitored in mouse and human 
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cells with a specific antibody (catalogue no. LS-C36943, Lifespan Biosciences). 
Other antibodies used on mouse cells were: integrin 8; (Clone eBioHMb1-1, 
catalogue no. 17-0291-80, eBiosciences), integrin 83; (Clone 2C9.G3, catalogue 
no. 12-0611, eBiosciences), integrin «; (catalogue no. 11-0493-83, eBiosciences), 
integrin B, (catalogue no. LS-C152915, Lifespan BioSciences), integrin Bg (Clone 
H-160, catalogue no. sc-25714, Santa Cruz Biotechnology) and pERK1/2 (cata- 
logue no. 4370, Cell Signaling). Antibodies used for the immunological character- 
ization of mouse cells included (from ebiosciences) IL-13 (catalogue no. 53-7133-82) 
and IL-22 (catalogue no. 12-7221-82); (from BD Biosciences) Ly6C (catalogue 
no. 560593), CD11b (catalogue no. 562127), CD4 (catalogue no. 560783), CD8 
(catalogue no. 560469), CD19 (catalogue no. 550992), CD138 (catalogue no. 
553714), IL-9 (catalogue no. 561492), IL-17 (catalogue no. 560522), IL-4 (cata- 
logue no. 557739), IL-6 (catalogue no. 561376), IFN-y (catalogue no. 560660), 
Foxp3 (catalogue no. 560047) and B220 (catalogue no. 561226); and (from 
Biolegend) CD3 (catalogue no. 100227) and Siglec H (catalogue no. 129611). 
The antibody against IFN-« was from PBL interferon source (catalogue no. 
22100-3). The antibody against CD317 was from eBiosciences (catalogue no. 
46-3172-82). Antibodies used with human fibroblasts were integrin B, (Clone 
MAR4, catalogue no. 557332, BD Biosciences) and integrin 8; (Clone VI-PL2, 
catalogue no. 17-0619-42, eBiosciences). 

Stimulation of human dermal fibroblasts with TGF-B1 in vitro. All cells were 
counted at splitting and all treatments were performed at 70% confluence. Cells were 
serum-starved for 48h before stimulation with 2ngml ‘ recombinant TGF-B1 
(catalogue no. 240-B-010, R&D). When TGF-B1 or vehicle was added, cell culture 
dishes were immediately rocked on the same rocker three times in air/CO, (19:1) at 
37 °C, to control for mechanical activation of mitogen-activated protein kinase. 
Before lysate harvest, cells were washed with prewarmed (42 °C) 1 X PBS (Gibco). 
All antibody treatments of human fibroblasts were added during starvation for 48 h 
before stimulation with TGF-B1, whereas inhibitors SD208 (1 4M) and UO126 
(10 LM) (catalogue nos 616456 and 662005, EMD Millipore) were added 6 h before 
stimulation. Antibodies used in vitro were mouse IgG1 (0.2mgml ’, Clone 
P3.6.2.8.1, catalogue no. 16-4714-81, eBiosciences), IgG2a (0.2 mgm, Clone eBM2a, 
catalogue no. 16-4724, eBiosciences), %,[3-blocking (30 1g ml ', Clone LM609, 
catalogue no. MAB1976Z, Millipore), B,-activating (7 1g ml~ ! Clone TS2/16, cata- 
logue no. 14-0299, eBiosciences) and f,-blocking (0.2 mg ml !, Clone P4C10, 
catalogue no. MAB1987Z, Millipore) antibodies. 

Western blotting. Before lysate harvest, cells were washed with prewarmed 
(42°C) 1X PBS (Gibco). Total protein was isolated from cells with ice-cold 
RIPA buffer (25 mM Tris/HCl pH 7.6, 150 mM NaCl, 1% Nonidet P40, 1% sodium 
deoxycholate, 0.1% SDS) with phosphatase and protease inhibitors (catalogue nos 


04906837001 and 11836170001, Roche). Western blotting was performed using 
the Bio-Rad and LiCor Odyssey detection systems as described previously’. The 
relative intensities were measured using LiCor Odyssey software. The following 
antibodies were used: phosphorylated and total ERK (Clone D13.14.4E, catalogue 
no. 4370, and Clone 3A7, catalogue no. 9107, Cell Signaling), vinculin (Clone 
hVIN-1, catalogue no. V9131, Sigma), and phosphorylated and total Smad3 (cata- 
logue nos 1880-1 and 1735-1, Epitomics). 

RNA isolation and qPCR. Total RNA was isolated from cultured cells or tissue by 
using Trizol (Invitrogen) in accordance with the manufacturer’s protocol. Quanti- 
tative PCR for miR-29a and 18S ribosomal RNA was performed with pre-designed 
Taqman primers and probes (ABI) in accordance with the manufacturer’s instruc- 
tions. Relative quantification for each transcript was obtained by normalizing 
against 18S transcript abundance according to the formula 2°“/2-'89), 
Statistics and graphs. All quantitative data are shown as standard boxplots pro- 
duced in R statistical software. The upper and lower margins of each box define the 
75th and 25th centiles, respectively; the internal line defines the median, and the 
whiskers define the range. Statistical analysis was performed with a two-tailed 
t-test, assuming equal variance between the compared groups (asterisk, P < 0.05; 
two asterisks, P<0.01; dagger, P< 0.001; double dagger, P< 0.0001). Values 
outside the interquartile range are shown as open circles (R software default), 
but were not excluded from or treated differently in statistical analyses. 
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A directional switch of integrin signalling and a new 


anti-thrombotic strategy 


Bo Shen!, Xiaojuan Zhao, Kelly A. O’Brien’, Aleksandra Stojanovic-Terpo', M. Keegan Delaney’, Kyungho Kim', Jaehyung Cho’, 


Stephen C.-T. Lam! & Xiaoping Du! 


Integrins have a critical role in thrombosis and haemostasis’. Anta- 
gonists of the platelet integrin a);,B3 are potent anti-thrombotic 
drugs, but also have the life-threatening adverse effect of causing 
bleeding’. It is therefore desirable to develop new antagonists that 
do not cause bleeding. Integrins transmit signals bidirectionally*”. 
Inside-out signalling activates integrins through a talin-dependent 
mechanism”. Integrin ligation mediates thrombus formation and 
outside-in signalling*’, which requires Ga,3 and greatly expands 
thrombi. Here we show that Ga; and talin bind to mutually exclu- 
sive but distinct sites within the integrin B; cytoplasmic domain in 
opposing waves. The first talin-binding wave mediates inside-out 
signalling and also ligand-induced integrin activation, but is not 
required for outside-in signalling. Integrin ligation induces tran- 
sient talin dissociation and Ga; binding to an EXE motif (in which 
X denotes any residue), which selectively mediates outside-in sig- 
nalling and platelet spreading. The second talin-binding wave is 
associated with clot retraction. An EXE-motif-based inhibitor of 
Ga.,3-integrin interaction selectively abolishes outside-in signalling 
without affecting integrin ligation, and suppresses occlusive arterial 
thrombosis without affecting bleeding time. Thus, we have discovered 
a new mechanism for the directional switch of integrin signalling 
and, on the basis of this mechanism, designed a potent new anti- 
thrombotic drug that does not cause bleeding. 
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Integrin signalling involves the binding of several molecules to the 
cytoplasmic domain of integrin B-subunits including talin®”, kindlins'*", 
SRC and Go,3° (Fig. 1a). Co-immunoprecipitation of Go,3 with 
various 3 carboxy-terminal truncation mutants suggests that Go. bind- 
ing involves the B sequence between Lys 729 and Thr 741 (Fig. 1b and 
Extended Data Fig. 2a), but not the kindlin- or SRC-binding sequences 
(Fig. 1a, b). Alignment of different B cytoplasmic domains reveals an 
EXE motif in this region, in which the first and third Glu residues 
are conserved among most B subunits, but not Bg (Fig. la). The EXE- 
motif-containing B,, B, and B; all bound Ga,3, but not Bg (Fig. 1c and 
Extended Data Fig. 2f). Wild-type and E732A mutant B3 bound to 
Go3, but the E731A, E733A, AAA (E731-733A) (Fig. 1d and Extended 
Data Fig. 2b), DED (E731D/E733D) and QSE (E731Q/E732S) (Extended 
Data Fig. 2e) mutants did not, indicating that the first and third Glu 
within the EXE motif are important for Ga; binding. Synthetic pep- 
tides containing the EEERA sequence inhibited Ga,;-f interaction 
(see below), verifying this EXE-motif-containing Ga, ,-binding site. 

The EXE motif is located in a talin-binding region (Fig. 1a)'*». 
Overexpression of the integrin-binding talin head domain (THD) in 
Oy»B3-expressing cells inhibited Ga; co-immunoprecipitation with 
B3 (Fig. le). Purified recombinant THD and Ga; competed directly 
for binding to purified glutathione S-transferase (GST)-B3 cytoplas- 
mic domain fusion protein (GST-B3CD) (Fig. 1f, g and Extended Data 


Figure 1 | Mutually exclusive binding of talin 
and Ga,3 to B3. a, The sequence of the human B; 
cytoplasmic domain and its alignment with other B 
subunits, showing conserved EXE motifs and 
binding sites for talin, kindlins and SRC. b, Co- 
immunoprecipitation of wild-type (WT) and 
truncated mutant B; with Ga,; and talin using anti- 
B3 or control pre-immune (pre-im) rabbit serum. 
Immunoprecipitates (IP) and CHO cell lysates 
(10% of that used in immunoprecipitation) were 
immunoblotted (IB) with indicated antibodies. 

c, Binding of purified recombinant Ga; to 
glutathione-bead-bound GST, and cytoplasmic 
domain fusion proteins GST-B,;CD, GST-B,CD, 
GST-B3CD or GST-BgCD. d, Co- 
immunoprecipitation of CHO-cell-expressed wild- 
type or EXE-motif-mutated 8; with Go; and talin 
using anti-B; or pre-immune rabbit serum. e, Co- 
immunoprecipitation of CHO-cell-expressed 
integrin 07,83 with Go; and THD after 
transfection with cDNA encoding THD. 

f, g, Inhibition of the binding of THD (20 nM) 

(f) or Gar;3 (40 nM) (g) to immobilized GST-B3;CD 
proteins (wild-type and negative control mutants) 
by increasing concentrations of Go,3 (f) or THD 
(g). Bound Ga,3 or THD was detected using anti- 
Go; or anti-talin. Error bars represent mean = s.d. 
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Figure 2 | Dynamics of talin and Ga); binding to B3 and the role of talin in 
integrin signalling. a—c, Human platelets were stimulated with 0.025 Uml * 
a-thrombin (in an aggregometer) with or without 2 mM integrin inhibitor 
RGDS, solubilized at various time points, immunoprecipitated with anti-B; or 
pre-immune rabbit serum, and immunoblotted for Go.,3, talin and B; 
(additional controls in Extended Data Fig. 3d). a, Typical immunoblots. 

b, Quantification of immunoblots (mean = s.d., three experiments). OD, 
optical density. c, Turbidity changes indicating integrin-dependent platelet 
aggregation. d, Immunoblotting of talin 1 in wild-type and Tin '~ (talin 

1 ‘-) mouse platelets. e, Aggregation of wild-type and talin 1”/~ platelets 
stimulated with 5 1M ADP in the presence of 20 pg ml ' fibrinogen, with or 
without 1 mM MnCl, or 0.3 1g ml | LIBS6. f, Adhesion of unstimulated mouse 
platelets to immobilized fibrinogen for 1h, with or without 1 mM MnCl or 
0.18 tg ml~' LIBS6 (quantified as percentage of loaded platelets, mean + s.d., 
n= 4, *P < 0.001). g, Images of phalloidin-stained mouse platelets spreading 
on fibrinogen for 1h, with or without 1 mM MnCl, or 0.18 ugml~' LIBS6 
(quantification in Extended Data Fig. 4e). 


Fig. 2c, d), indicating that Go,3 and talin are mutually exclusive in 
binding to 3. Interestingly, the binding of talin and Ga; is regulated 
temporally during integrin signalling (Fig. 2). The first wave of talin 
association with o7,$3 occurred after thrombin-stimulated inside-out 
signalling (Fig. 2a, b) and before the onset of integrin ligation (as indi- 
cated by platelet aggregation (Fig. 2c)). However, after integrin ligation, 
talin association with oy,83 was diminished (Fig. 2a, b). The second 
wave of talin-B, association occurred after full platelet aggregation 
(Fig. 2a—c), the timing of which correlates with clot retraction. Opposite 
to the waves of talin binding, the Go.,3-B3 association was even lower 
than the basal level during inside-out signalling when the first talin- 
binding wave occurred (Fig. 2a, b), but peaked after integrin ligation 
when the first talin-binding wave subsided, and then decreased again 
during the second talin-binding wave (Fig. 2a, b). Thus, inside-out and 
various phases of outside-in signalling are associated with coordinated 
and opposing waves of Go; and talin binding to B3. 

Importantly, an increase in Go 3 binding to integrin can only be 
induced when integrin is activated in the presence of fibrinogen, but 
not by integrin activation alone (Extended Data Fig. 3a). Conversely, 
the integrin inhibitors RGDS (Arg-Gly-Asp-Ser, Fig. 2a, b) or EDTA 
(ethylenediaminetetraacetic acid, Extended Data Fig. 3b, c) prevented 
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dissociation of talin from B3 and inhibited Ga,3-B3 interaction in 
thrombin-stimulated platelets. Thus, the switch from a talin-bound 
to a Ga3-bound state of o,83 is initiated by the binding of macro- 
molecular ligands. 

The opposing waves of talin and Go; binding to B; suggest that the 
interaction of these two proteins with B; selectively mediates inside- 
out and outside-in signalling, respectively. This hypothesis was tested 
using talin knockout'® and shRNA-induced talin knockdown platelets, 
which are defective in adenosine diphosphate (ADP)/fibrinogen-induced, 
integrin-dependent aggregation (Fig. 2d, e and Extended Data Fig. 4a, c). 
Their defective aggregation was fully corrected with manganese or an 
integrin-activating antibody (LIBS6) (Fig. 2e and Extended Data Fig. 4c), 
which activate integrins independently of inside-out signalling. These 
data confirm a role for talin in inside-out signalling®'*'”. It is estab- 
lished that inside-out signalling is not the only pathway of oy, acti- 
vation. Integrin-fibrinogen interaction may occur independently of 
inside-out signalling when fibrinogen changes conformation, either 
by immobilization or conversion to fibrin'’*”. This is because the initial 
contact of the exposed ligand recognition sequence, RGD, with resting 
integrins triggers ligand-induced integrin activation”. Interestingly, 
adhesion of resting talin-knockout or -knockdown platelets to immo- 
bilized fibrinogen was defective (Fig. 2f and Extended Data Fig. 4b), 
indicating the importance of talin in platelet adhesion to immobilized 
fibrinogen in the absence of inside-out signalling. However, addition of 
manganese or integrin-activating antibody fully corrected talin-knockout 
and -knockdown platelet adhesion and spreading (and also the spreading 
of talin-binding-defective mutant B3-expressing Chinese hamster ovary 
(CHO) cells”) on immobilized fibrinogen (Fig. 2f, g and Extended Data 
Fig. 4b, d, e). Thus, the role of talin in resting platelet adhesion to 
fibrinogen is solely due to its importance in ligand-induced integrin 
activation. Because cell spreading requires the early phase of outside-in 
signalling, these data further demonstrate that talin is not required for 
the early phase of outside-in signalling leading to cell spreading once its 
role in integrin activation is bypassed. 

To assess whether Ga; binding to the EXE motif selectively medi- 
ates outside-in signalling without perturbing talin-dependent integrin 
function, wild-type and AAA mutant B3-transfected ITGB3 (Bs) /~ 
bone marrow stem cells (from B; /— mice) were transplanted into 
irradiated B; ‘~ mice. The platelets from the recipient mice expressed 
similar levels of wild-type or AAA mutant B; (Fig. 3a and Extended 
Data Fig. 5a). The AAA mutation inhibited 83 interaction with Gos, 
but not talin (or SRC) (Fig. 3b and Extended Data Fig. 5b), during 
integrin signalling. The AAA mutation also had no effect on agonist- 
induced soluble fibrinogen binding (Fig. 3c). Thus, the EXE motif is 
not required for talin-dependent inside-out signalling. By contrast, the 
AAA mutant f3-expressing platelets were defective in spreading on 
immobilized fibrinogen (Fig. 3d and Extended Data Fig. 5c, d). Thus, 
Go, 3-binding deficiency in B3 causes a selective defect in integrin 
outside-in signalling and platelet spreading. Similarly, AAA and more 
conserved DED or QSE 3 mutants expressed in CHO cells, all defect- 
ive in Ga, 3 binding (Extended Data Fig. 2e), and were also defective in 
spreading on fibrinogen (Fig. 3e, f and Extended Data Fig. 6a-c). 
However, AAA mutant 23 expressed in CHO cells had no negative 
effect on THD binding, in contrast to the Y747A mutant (Extended 
Data Fig. 6d, e). In addition, AAA-expressing cells showed defects in 
integrin-dependent activation of SRC (as shown by phosphorylation at 
Tyr 416) and transient inhibition of RHOA during cell spreading 
(Fig. 3g and Extended Data Fig. 6f), both of which are important 
elements of outside-in signalling. Together with previous studies that 
identified B, sequences mediating talin binding (Fig. la)*'*'””’, our 
data suggest that talin and Go 3 dynamically interact with distinct 
recognition sequences in the same region of {3 to serve as a molecular 
switch controlling the direction of integrin signalling. 

The specific role of the EXE motif in outside-in signalling prompted 
us to design selective inhibitors of outside-in signalling. We synthesized 
several myristoylated (Myr) EXE-motif-containing B3; peptides: mPs; 
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c Figure 3 | The selective role of Ga; EXE binding 


IP:pre-imm anti-B, in integrin outside-in signalling. a, Flow 
Time (min) 0 ae 2 REED 30.0 1 2 5 10 30 cytometric analysis of 8; expression in platelets 
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AAA mutant f3-transfected bone marrow stem 
cells. 8; ‘~ platelets served as negative control. 
Picture shows relative fluorescence (FL1) on the 
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(Myr-EEERA), mP, (Myr-FEEERA) and mP; (Myr-KFEEERARAKWDT). 
These peptides inhibited co-immunoprecipitation between Ga; and 
B3 (Fig. 4a and Extended Data Fig. 7a-d), indicating that the minimal 
sequence of EEERA is sufficient to bind Ga, 3. By contrast, only mP);, 
but not mP, (or mPs), inhibited talin association with B3 (Fig. 4a), 
indicating that mP, does not interact with talin. mP, inhibited platelet 
spreading on fibrinogen (Fig. 4b and Extended Data Fig. 7e), but had 
no effect on either agonist-induced fibrinogen/PAC1 (an ligand-mimetic 
antibody recognizing activated o,,f3) binding to platelets (Fig. 4c and 
Extended Data Fig. 7f, g) or platelet adhesion to immobilized fibrino- 
gen (Fig. 4d). Interestingly, mP, did not inhibit, but rather accelerated, 
platelet-dependent clot retraction (Fig. 4e). These data indicate that the 
EXE-based inhibitor mP¢ selectively inhibits the early phase of outside- 
in signalling without affecting talin-dependent inside-out signalling, 
ligand-induced integrin activation, or the late phase of outside-in sig- 
nalling associated with the second wave of talin binding. By contrast, 
mP);3 inhibited inside-out and outside-in signalling, as it inhibited 
fibrinogen binding (Extended Data Fig. 7h), platelet adhesion (Fig. 4d) 
and clot retraction (Fig. 4e) (not reversed by manganese, as previously 
shown using talin”‘~ platelets'®). Thus, mP, selectively interferes with 
the early phase of outside-in signalling, but mP; affects all phases of 
integrin signalling. Importantly, mP. inhibited the second wave of 
thrombin-induced platelet aggregation in vitro (Fig. 4f), and when 
injected into mice as micelles, was as potent as the currently used 
integrin antagonist Integrilin in inhibiting laser-induced arteriolar 
thrombosis (Fig. 4g, h, Extended Data Fig. 8a, b and Supplementary 
Videos 1-4) and FeCl,-induced occlusive carotid artery thrombosis 
(Fig. 4i and Extended Data Fig. 8c). Notably, at the concentration at 
which both Integrilin and mPg similarly inhibited occlusive throm- 
bosis, Integrilin considerably prolonged tail bleeding and increased 
blood loss, whereas mP¢ had no such adverse effect (Fig. 4) and Extended 
Data Fig. 8d). Thus, we have discovered a novel anti-thrombotic that 
prevents thrombosis without causing bleeding. 

Together, our study provides a conceptual advance by revealing a 
molecular switch controlling the directions and consequences of integrin 
signalling. We show that the switch between inside-out and outside-in 
signalling is mediated by coordinated but opposing waves of talin and 
G3 binding to distinct yet adjacent sequences within the B; cytoplas- 
mic domain. The discovery of this signalling switch forms a conceptual 
basis for selectively inhibiting outside-in signalling without perturbing 
the ligand-binding function of integrins. Importantly, we translated 
this new concept into a potent novel anti-thrombotic, which, unlike 
currently available integrin antagonists or other anti-thrombotics, 
potently inhibits arterial thrombosis without the adverse effect of caus- 
ing bleeding (Fig. 4g-j), a potentially life-threatening problem that 
limits the clinical use of current anti-integrin and anti-thrombotic 
therapies. 


METHODS SUMMARY 

Co-immunoprecipitation. Co-immunoprecipitation was performed as described 
previously* using platelets or %,B3-expressing CHO-1b9 cells”. 

Myristoylated peptide inhibitors. Myristoylated peptide inhibitors were synthe- 
sized and purified at the Research Resource Center at University of Illinois at Chicago. 
These include mPs, mP¢ and mP}3, as well as their respective scrambled controls: 
mP,3Scr (Myr-EEARERKDWAKFT), mP;Scr (Myr-EEARE) and mP¢Scr (Myr- 
ERAFEE). The peptides were prepared in DMSO for in vitro and in micellar 
formulation for in vivo (and in vitro) uses. The micellar formulation has a molar 
ratio of PEG 999-DSPE, L-a-phosphatidylcholine, and peptides of 45:5:2, and was 
prepared as described previously™*. mP, is similar to mP,Scr in uptake by platelets 
(Extended Data Fig. 9a) and does not cause significant changes in haemogram 
in vivo (Extended Data Fig. 9b). 

Bone marrow stem cells. Bone marrow stem cells from 6-8-week-old integrin 
B; ‘~ or C57BL/6 mice were infected twice with concentrated lentivirus contain- 
ing shRNA or complementary DNA constructs, then retro-orbitally injected into 
irradiated recipient mice (5Gy for integrin B; ‘~ mice and 9.6Gy for C57BL/6 
mice) 1 day after irradiation®. Platelet functional analyses'*”*, flow cytometry”, 
laser-induced cremaster muscle arterial thrombosis*” and FeCl3-induced carotid 
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arterial thrombosis* were performed as described previously. Data were analysed 
using t-test or one-way analysis of variance. 

Tail bleeding time. Analysis was performed as described previously”. Time to stable 
cessation of the bleeding is defined as no re-bleeding for 60s. Bleeding exceeding 
15 min was immediately stopped. Data were analysed using the Mann-Whitney test. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 


Received 8 November 2012; accepted 28 August 2013. 
Published online 27 October 2013. 


1. Shattil, S. J. & Newman, P. J. Integrins: dynamic scaffolds for adhesion and 
signaling in platelets. Blood 104, 1606-1615 (2004). 

2. Coller, B.S. Anti-GPllb/Illa drugs: current strategies and future directions. Thromb. 
Haemost. 86, 427-443 (2001). 

3. Serebruany, V. L, Malinin, A. |., Eisert, R. M. & Sane, D. C. Risk of bleeding 
complications with antiplatelet agents: meta-analysis of 338,191 patients 
enrolled in 50 randomized controlled trials. Am. J. Hematol. 75, 40-47 (2004). 

4. Hynes, R. O. Integrins: bidirectional, allosteric signaling machines. Cel/ 110, 
673-687 (2002). 

5. Moissoglu, K. & Schwartz, 
Cell 98, 547-555 (2006). 

6. Tadokoro, S. et a/. Talin binding to integrin beta tails: a final common step in 
integrin activation. Science 302, 103-106 (2003). 

7. Ye, F., Kim, C. & Ginsberg, M. H. Molecular mechanism of inside-out integrin 
regulation. J. Thromb. Haemost. 9 (Suppl. 1), 20-25 (2011). 

8. Gong,H. etal. G protein subunit Ga; 3 binds to integrin «4,83 and mediates integrin 
“outside-in” signaling. Science 327, 340-343 (2010). 

9. Shen, B., Delaney, M. K. & Du, X. Inside-out, outside-in, and inside-outside-in: G 
protein signaling in integrin-mediated cell adhesion, spreading, and retraction. 
Curr. Opin. Cell Biol. 24, 600-606 (2012). 

10. Moser, M., Nieswandt, B., Ussar, S., Pozgajova, M. & Fassler, R. Kindlin-3 is essential 

for integrin activation and platelet aggregation. Nature Med. 14, 325-330 (2008). 

11. Ma, Y.Q., Qin, J., Wu, C. & Plow, E. F. Kindlin-2 (Mig-2): a co-activator of B3 integrins. 

J. Cell Biol. 181, 439-446 (2008). 

12. Obergfell, A. et a/. Coordinate interactions of Csk, Src, and Syk kinases with opB3 

initiate integrin signaling to the cytoskeleton. J. Cel! Biol. 157, 265-275 (2002). 

13. Flevaris, P. et a/. A molecular switch that controls cell spreading and retraction. 

J. Cell Biol. 179, 553-565 (2007). 

14. Patil, S. et al. Identification of a talin-binding site in the integrin B3 subunit distinct 

from the NPLY regulatory motif of post-ligand binding functions. The talin 

-terminal head domain interacts with the membrane-proximal region of the B3 

cytoplasmic tail. J. Biol. Chem. 274, 28575-28583 (1999). 

15. Wegener, K. L. et al. Structural basis of integrin activation by talin. Ce// 128, 

171-182 (2007). 

16. Haling,J.R., Monkley, S.J., Critchley, D. R. & Petrich, B. G. Talin-dependent integrin 

activation is required for fibrin clot retraction by platelets. Blood 117, 1719-1722 

(2011). 

17. Petrich, B. G. et a/. Talin is required for integrin-mediated platelet function in 

hemostasis and thrombosis. J. Exp. Med. 204, 3103-3111 (2007). 

18. Coller, B.S. Interaction of normal, thrombasthenic, and Bernard-Soulier platelets 

with immobilized fibrinogen: defective platelet-fibrinogen interaction in 

hrombasthenia. Blood 55, 169-178 (1980). 

19. Ugarova, T. P. et al. Conformational changes in fibrinogen elicited by its interaction 

with platelet membrane glycoprotein GPIlb-llla. J. Biol. Chem. 268, 21080-21087 

(1993). 

20. Du, X. etal. Ligands “activate” integrin «1,83 (platelet GPllb-llla). Ce// 65, 409-416 
(1991). 

21. Arias-Salgado, E. G., Lizano, S., Shattil, S. J. & Ginsberg, M. H. Specification of the 
direction of adhesive signaling by the integrin B cytoplasmic domain. J. Biol. Chem. 
280, 29699-29707 (2005). 

22. Goksoy, E. etal. Structural basis for the autoinhibition of talin in regulating integrin 
activation. Mol. Cell 31, 124-133 (2008). 

23. Xi, X., Bodnar, R. J., Li, Z. Y., Lam, S. C. T. & Du, X. P. Critical roles for the COOH- 
terminal NITY and RGT sequences of the integrin B3 cytoplasmic domain in inside- 
out and outside-in signaling. J. Cel/ Biol. 162, 329-339 (2003). 

24. Krishnadas, A., Rubinstein, |. & Onyuksel, H. Sterically stabilized phospholipid 
mixed micelles: in vitro evaluation as a novel carrier for water-insoluble drugs. 
Pharm. Res. 20, 297-302 (2003). 

25. O’Brien, K.A., Gartner, T. K., Hay, N. & Du, X. ADP-stimulated activation of Akt during 
integrin outside-in signaling promotes platelet spreading by inhibiting glycogen 
synthase kinase-38. Arterioscler. Thromb. Vasc. Biol. 32, 2232-2240 (2012). 

26. Delaney, M. K., Liu, J., Zheng, Y., Berndt, M. C. & Du, X. The role of Racl in 
glycoprotein Ib-IX-mediated signal transduction and integrin activation. 
Arterioscler. Thromb. Vasc. Biol. 32, 2761-2768 (2012). 

27. Cho,J.etal. Protein disulfide isomerase capture during thrombus formation in vivo 
depends on the presence of £3 integrins. Blood 120, 647-655 (2012). 

28. O’Brien, K.A., Stojanovic-Terpo, A., Hay, N. & Du, X. An important role for Akt3 in 
platelet activation and thrombosis. Blood 118, 4215-4223 (2011). 

29. Marjanovic, J. A., Li, Z., Stojanovic, A. & Du, X. Stimulatory roles of nitric-oxide 
synthase 3 and guanylyl cyclase in platelet activation. J. Biol. Chem. 280, 
37430-37438 (2005). 


.A. Integrin signalling in directed cell migration. Biol. 


©2013 Macmillan Publishers Limited. All rights reserved 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank T. Kozasa, B. Kreutz and C. Chow for providing purified 
recombinant Go.:3 protein; and B. Petrich and D. Critchley for providing talin ’~ mice. 
We acknowledge that H. Gong performed experiments for this project. This work is 
supported by grants from National Heart, Lung, and Blood Institute (HLO80264, 
HLO62350 (X.D.) and HL109439 (J.C.)). 


Author Contributions B.S. performed most of the experiments and participated in 
experimental design, data analysis and manuscript writing. X.Z., K.A.0., A.S.-T. and 
M.K.D. each performed parts of the experiments and participated in aspects of data 


LETTER 


analyses and manuscript writing. K.K. and J.C. performed laser-induced thrombosis 
experiments and data analysis; S.C.-T.L. provided talin constructs and purified 
proteins, and participated in discussions and data analyses; X.D. designed and directed 
the research, analysed data and wrote the paper. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare competing financial interests: details 
accompany the full-text HTML version of the paper at www.nature.com/nature. 
Readers are welcome to comment on the online version of the paper. 
Correspondence and requests for materials should be addressed to X.D. 
(xdu@uic.edu). 


7 NOVEMBER 2013 | VOL 503 | NATURE | 135 


©2013 Macmillan Publishers Limited. All rights reserved 


LETTER 


METHODS 

Animals and reagents. Integrin 8; ‘~ mice were obtained from the Jackson 
Laboratory. Tin1", PF4-Cre mice were provided by B. Petrich and D. Critchley’®. 
Animal usage and protocol were approved by the institutional animal care com- 
mittee of the University of Illinois at Chicago. For all animal experiments, mice 
with similar age, weight and sex ratios (1:1, except for laser-induced thrombosis) 
were used for control and specific treatment. The individual mice chosen for 
specific treatment were decided randomly. Human integrin 8; cDNA was cloned 
into pcDNA3.1 vector following digestion with HindIII and Xho I, or pLenti6-V5/ 
Dest vector following digestion with EcoR I, Mfe I and Xho I. Truncation mutants 
and integrin Glu to Ala mutants were either reported previously~* or generated 
using PCR and cloned into pcDNA3.1 vector by BamHI and Xho I. The upstream 
(UP) and downstream (DN) primer sequences used are: (1) ITGB3-UP: 5’-GCG 
AAGCTTGCCGCCATGGACCGAGCGCGGCCGCGGCCCCGGCCGCTCT-3’; 
(2) ITGB3-728DN: 5'-GCGCTCGAGTCAAGCGAATTCTTTTCGGTCGTGGA 
TGGTGATGAG-3’; (3) ITGB3-715DN: 5'-GCGCTCGAGTCACCAGATGAGC 
AGGGCGGCAAGGCCAATGAGCAG-3’; (4) Itgb3-E731A-UP: 5’- AAGAATT 
CGCTAAATTTGCAGAAGAACGCGCCAGAGCAA-3’; (5): Itgb3-E732A-up: 
5'- AAGAATTCGCTAAATTTGAGGCAGAACGCGCCAGAGCAA-3’; (6): Itgb3- 
E733A-UP: 5'- AAGAATTCGCTAAATTTGAGGAAGCACGCGCCAGAGCAA-3’; 
(7): Itgb3-E731-733A-UP: 5'- AAGAATTCGCTAAATTTGCAGCAGCACGCG 
CCAGAGCAA-3’; (8): Mfe-ITGB3-UP: 5'-CCGCAATTGGCCGCCATGGACC 
GAGCGCGGCCGCGGCCCCGGCCGCTCT-3’; (9): Xho I-ITGB3-DN: 5'- GCG 
CTCGAGTTAAGTGCCCCGGTACGTGATATTG-3’. Human integrin BgCD 
cDNA was cloned into pGEX4T-1 vector following digestion with BamHI and 
Xho I. Primer sequences used are: (1): [TGB8-UP: 5'-CGTGGATCC ATTAGACA 
GGTGATACTACAATGG-3’; (2): ITGB8-DN: 5’- GCGCTCGAGTTAGAAGT 
TGCACCTGAAAGTTTC-3’. GST-B3CD and recombinant Go3 purification 
was previously described*. Human THD cDNA, corresponding to N-terminal 
talin amino acid residues 1-433, was cloned into pcDNA3.1 vector and pMal- 
C2 vector between EcoR I and Xho I sites. Anti-RHOA antibody was purchased 
from Cytoskeleton; anti-Ga,3(sc410), anti-c-Sre (sc18), anti-talin (sc7534) and 
anti-integrin B3 (sc6627) antibodies were from Santa Cruz Biotechnology; anti- 
Ge.,3(26004) was from NewEast; anti-phospho-Sre Y*"° antibody was obtained 
from Cell Signaling; anti-talin (TA205) was from Millipore; anti-talin antibody 
8d4 (T3287) was obtained from Sigma; PAC] antibody (340507) and anti-mouse 
Oy, antibody MWReg3 (14-0411) were obtained from BD Biosciences; anti- 
human integrin B, antibody MAb15, LIBS6 and 8053 rabbit serum were kindly 
provided by M. Ginsberg (University of California, San Diego); Lipofectamine 
2000, viraPower lentivirus expression system, Alexa Fluor 546-conjugated phal- 
loidin, Alexa Fluor 488-conjugated anti-mouse secondary antibody, talin 1 shRNA 
plasmids (NM-011602), and non-specific shRNA control vector were from Invitrogen; 
Y-27632 is from Calbiochem; Fibrinogen from Enzyme Research Laboratories. 
Purified Ga,; and THD binding to integrin cytoplasmic domains. GST-tagged 
integrin cytoplasmic domain proteins were coated onto Pierce Glutathione-coated 
plates overnight at 4 °C. After washing twice with NP40 buffer (50 mM Tris, pH 7.4, 
10mM MgCl, 150mM NaCl, 1% NP-40, 1mM sodium orthovanadate, 1mM 
NaF) with complete protease inhibitor cocktail tablets (1 tablet per 5 ml buffer, 
Roche), purified THD or Ga, proteins were added onto the plate in NP40 buffer 
(for Gor 3 binding, buffer contained 30 uM AIF, ). Bound THD or Go; was estimated 
with anti-talin or anti-Go,3; antibody, horse radish peroxide (HRP)-conjugated 
secondary antibody, and 3,3’,5,5’-tetramethylbenzidine Substrates (Pierce). The 
wells were washed three times with NP40 buffer between each of these steps. The 
reactions were terminated with 1 M sulphuric acid and measured for OD 459 pm. For 
the competitive inhibition assay, increasing concentrations of THD or Gor,3 was 
added to the reactions. 

Platelet preparation. Studies using human blood were approved by the insti- 
tutional review board at the University of Illinois at Chicago, and informed con- 
sent was obtained from all donors. Washed human platelets from healthy donors 
who have not taken medication within 2 weeks before donation and platelets from 
8-12-week-old mice were prepared as described previously and re-suspended in 
modified Tyrode’s buffer’’. 

Platelet aggregation assay. Platelet aggregation and secretion were measured in a 
turbidometric platelet aggregometer (Chronolog) at 37 °C with stirring (1,000 
r.p.m.). Washed platelets (3 X 10° ml” ') in modified Tyrode’s buffer were stimu- 
lated with thrombin (Enzyme Research Laboratories). Aggregation traces shown 
are representative of at least three independent experiments. 

Fibrinogen and PACI1 binding assay. For the fibrinogen binding assay, washed 
human or mouse platelets resuspended in modified Tyrode’s buffer were incu- 
bated with 10 1g ml’ Oregon Green-conjugated fibrinogen (Molecular Probes) 
and PAR4AP as described previously”’. The reaction was diluted with PBS and 
analysed by flow cytometry using an Accuri C6 flow cytometry (BD Biosciences). 
PACI binding was measured with FITC-labelled PAC1 antibody (Molecular Probe). 


Co-immunoprecipitation. As described previously’, platelets or CHO-1b9 cells 
expressing recombinant integrin 0),}3;~ were solubilized in NP40 lysis buffer 
(50mM Tris, pH7.4, 10 mM MgCl, 150mM NaCl, 1% NP-40, 1mM sodium 
orthovanadate, 1mM NaF), with complete protease inhibitor cocktail tablets 
(1 tablet per 5 ml buffer, Roche). Lysis debris was cleared after centrifugation at 
14,000g for 10 min. Lysates were then immunoprecipitated with rabbit anti-Go,, 
IgG, anti-integrin B; rabbit serum or an equal amount of rabbit IgG or pre- 
immune serum for 2h before Protein A/G sepharose beads were added. After 
incubation of Protein A/G sepharose beads for 45 min at 4°C, beads were cen- 
trifuged down and washed for six times with NP40 lysis buffer. Immunopre- 
cipitates were analysed by immunoblotting. 

RHOA activity assay. Platelets or o%7,h3-expressing CHO cells in modified Tyrode’s 
buffer or adherent on immobilized fibrinogen were solubilized in cold NP40 lysis 
buffer at 4°C, and debris-cleared lysates were incubated for 1 h with purified GST- 
RBD beads, washed, and then immunoblotted with an anti-RhoA monoclonal 
antibody, as described previously*. 

Bone marrow transplantation. As described previously*, bone marrow stem cells 
were isolated from femur and tibias of 6-8-week-old integrin B; ‘~ or C57/BL6 
mice using the MACS lineage cell depletion kit (Miltenyi Biotec). Stem cells were 
subsequently infected twice with concentrated lenti-virus containing shRNA or 
cDNA constructs, as described in Animals and Reagents section, using a Lenti-X 
concentrator (Clontech). The cells were then retro-orbitally injected into irradiated 
recipient mice (5Gy for integrin B; ‘~ mice and 9.6Gy for C57/BL6 mice, one 
million cells per recipient mice) one day after irradiation. 

Platelet adhesion assay. As described previously~’, washed platelets were pre- 
incubated with vehicle or peptides, or with either 1mM MnCl, or 0.18 pg ml * 
LIBS6 before plating. After 1h incubation at 37 °C, adherent platelets were esti- 
mated by measuring platelet phosphatase activity with 0.3% p-nitrophenyl phos- 
phate in 1% Triton X-100, 50 mM sodium acetate, pH 5.0, for 1h at 37°C. The 
reaction was stopped with 1 M NaOH. Results were determined by reading OD 405 nm- 
Statistical significance was determined using t-test (n = 3). 

Cell spreading, immunofluorescence and confocal microscopy. Washed plate- 
lets or %,83-expressing CHO cells suspended in modified Tyrode’s buffer were 
added to 100 1g ml’ fibrinogen (Enzyme Research Laboratories)-coated cover slides 
and incubated at 37 °C for various lengths of time. Cells were fixed, permeabilized, 
blocked with 0.5% BSA in modified Tyrode’s buffer, stained with mAb15 (followed 
by Fluor 488-conjugated anti-mouse secondary antibody) and/or Alexa Fluor 546- 
conjugated phalloidin, and viewed with a Zeiss LSM510 META confocal micro- 
scope, as described previously*, or with Leica DM IRB fluorescence microscope, 
Photometrics CoolSNAP HQ camera and iManager software. Cell surface area 
was measured by NIH Image] analysis of 5-10 random images. Statistical signifi- 
cance was determined using t-test. 

Clot retraction assay. As previously described*, human PRP was incubated with 
vehicle or peptides for 5 min at room temperature (22 °C) before stimulation with 
thrombin. The two-dimensional size of retracted clots was quantified using Image 
J software, and statistical significance was determined using t-test (1 = 3). 
Peptide inhibitors. Myristoylated peptides were synthesized and purified at the 
Research Resource Center at the University of Illinois at Chicago. These peptides 
include: mP}3 (Myr-KFEEERARAKWDT), mP; (Myr-EEERA), mP. (Myr-FEEERA) 
and the corresponding control peptides mP,3Scr (Myr-EEARERKDWAKFT), 
mP.Scr (Myr-EEARE), and mP,Scr (Myr-ERAFEE). The peptides were prepared 
in DMSO for use in vitro, and in micellar formulation for in vivo (and in vitro) use. 
For micellar formulation, PEG999-DSPE, L-o-phosphatidylcholine, and peptides 
were mixed at a molar ratio of 45:5:2. The micelles were suspended to form micelle 
colloid in HEPES-saline buffer ((10 mM HEPES, 150 mM NaCl, pH 7.4), peptide 
concentration 1 mM) as described previously~*. mP< is similar to mP¢Scr in uptake 
by platelets (Extended Data Fig. 9a) and does not cause significant changes in 
hemogram in vivo (Extended Data Fig. 9b). 

Estimation of peptide concentration in platelets. mP, and mP,Scr peptides 
were dissolved and conjugated with PDAM overnight in the dark in DMSO, or 
conjugated in methanol and incorporated into the micelle as described above. 
Platelets were incubated with the PDAM-conjugated peptides for 5 min at room 
temperature, pelleted via centrifugation, washed and lysed with NP40 lysis buffer, 
and the concentration of PDAM-conjugated peptide was estimated by measuring 
fluorescence intensity (absorption 340 nm/emission 395 nm) as described prev- 
iously*’. Platelet lysates (without peptide incubation) were used as a blank control. 
Standard curve was obtained using known concentrations of peptides added to 
platelet lysates. 

In vivo FeCl;-induced thrombosis and tail bleeding time. 7-8-week-old 
C57BL/6 mice were anaesthetized by isoflurane inhalation. Retro-orbital injection 
of peptide micelle or integrilin (5 jzmol kg * mouse weight) were performed 15 min 
before experimentation. Carotid arterial thrombosis was induced with a filter 
paper disc (diameter = 2 mm) soaked with 1.2 pl of 7.5% FeCl;”*. Blood flow was 
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monitored with a TS420 flow meter using a MA-0.5SB dopler probe (Transonic 
Systems). Data were analysed using one-way ANOVA. Tail bleeding time analysis 
were performed as described previously’. Time to stable cessation of bleeding was 
defined as no evidence of rebleeding for 60s. Bleeding exceeding 15min was 
immediately stopped by applying pressure. Statistical significance was determined 
using the Mann-Whitney test. Similar results were also obtained with a nonpara- 
metric ANOVA. For bleeding assays measuring total blood loss, cut mouse tails 
were immersed in microcentrifuge tubes with 1.5 ml of 0.15 M NaCl at 37 °C for 
15min. The haemoglobin concentration in the tube was determined using a 
HemoCue photometer. Data were analysed using one-way ANOVA. The experi- 
ments were performed in double-blinded fashion. 

Intravital microscopy and laser-induced thrombosis. Similar to the methods 
described previously’’, wild-type male mice (6-8 weeks old) were anaesthetized 
via intraperitoneal injection of ketamine and xylazine and placed on a thermo- 
controlled blanket (37 °C). The cremaster muscle was exteriorized and superfused 
with thermo-controlled (37 °C) bicarbonate-buffered saline for the duration of 
experiments. Fluorescence and bright-field images were recorded using an Olympus 
BX61W microscope with a 60X/1.0 NA water immersion objective and a high- 
speed camera (Hamamatsu C9300) through an intensifier (Video Scope Inter- 
national). Fluorescence images were captured at 20 frames per second, and data 
were analysed using Slidebook v5.5 (Intelligent Imaging Innovations). Arteriolar 
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wall injury was induced with a micropoint laser ablation system (Photonics Instru- 
ments). Platelet accumulation was visualized by infusion of Dylight 649-labelled 
anti-mouse CD42c (Emfret, 0.05 gg * body weight) into mice. Vehicle control, 
Integrilin, scrambled peptide or mP¢ were infused 3 min before laser injury. Laser- 
induced thrombi were generated at different sites in the blood vessel, with new sites 
upstream of earlier thrombi. Data were collected for 5 min following laser injury. 
The kinetics of platelet accumulation was analysed by median fluorescence values 
of the antibodies as a function of time in approximately 30 thrombi in three mice 
per group. Statistical difference of fluorescence intensity (mean + s.d.) at selected 
time points was also determined using Welch’s t-test. The experiments were per- 
formed in double-blinded fashion. 

Statistics. For parametric data, statistical significance was analysed using Student’s 
t-test (or Welch t-test for samples with nonequal variances) or ANOVA following 
determination of normal distribution and equal variances. For nonparametric data 
(bleeding time analysis), Mann-Whitney test was applied. Analyses were per- 
formed with GraphPad Prism 4 software. Sample size estimation was performed 
with Fisher’s exact test using GraphPad InStat 3. 


30. Nimura,N., Kinoshita, T., Yoshida, T., Uetake, A. & Nakai, C. 1-Pyrenyldiazomethane 
as a fluorescent labeling reagent for liquid chromatographic determination of 
carboxylic acids. Anal. Chem. 60, 2067-2070 (1988). 
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Extended Data Figure 1 | A schematic showing how selective inhibitors of integrin outside-in signalling work as anti-thrombotics. Blue arrows indicate steps 


that are inhibited. 
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Extended Data Figure 2 | The importance of the conserved EXE motif in 
integrin-Ga,; interactions. a, b, Lysates from CHO cells expressing similar 
levels of wild-type (WT) o,83, Bs C-terminal truncation mutants (A759, A741, 
A728 and A715) complexed with wild-type op, (a), and B; EXE motif mutants 
(E731A, E732A, E733A and EEE to AAA) complexed with wild-type om 

(b) were immunoprecipitated with anti-Go,; antibody or equal amount of 
control rabbit IgG. Immunoprecipitates and lysates (equivalent of 10% used for 
immunoprecipitation) were immunoblotted with anti-Go,3 and anti-B; 
antibodies. c, d. GST-B3CD (WT) or GST-B3(AAA)CD (AAA mutant) 
proteins immobilized in glutathione-coated microtitre wells were incubated 
with increasing concentrations of Go3 (c), or increasing concentrations of 
THD (d). After washing, bound Ga; and THD were respectively detected 
using anti-Go,; or anti-talin (mouse IgG was used as a specificity control) 
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followed by secondary horseradish peroxidase-labelled anti-IgG antibody. e, In 
addition to the AAA mutation, conserved mutations of EEE to DED and EEE to 
QSE (as found in B;) were introduced to the 3 cytoplasmic domain. These 
mutants were co-transfected with wild-type oy, into CHO cells, which were 
sorted to achieve comparable expression levels with wild-type-o»3- 
expressing cells (as shown in Extended Data Fig. 4e). Lysates from these cells 
were immunoprecipitated with anti-§; or equal amount of pre-immune rabbit 
serum. Lysates (10%) and immunoprecipitates were immunoblotted with 
anti-Go.3 or anti-B3. f, Lysates from human platelets (with or without 
stimulation with 0.025 U ml’ thrombin) were immunoprecipitated with 
anti-Goa3 antibody or equal amount of control rabbit IgG. Immunoprecipitates 
were immunoblotted with anti-Ga,;3 and anti-B, antibodies. Go,3 is associated 
with B,, which is increased after thrombin stimulation. 
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Extended Data Figure 3 | Ligand occupancy induces switch of integrin 
Oyph3 from the talin-bound to the Ga,3-bound state a, To determine the 
effect of integrin activation and ligand occupancy on Ga,3-B3 association, 
human platelets were incubated with or without 1 mM MnCl, and 30 pg ml! 
fibrinogen for 5 min at 22 °C. Platelet lysates were then immunoprecipitated 
with anti-B; or pre-immune rabbit serum. Lysates (10%) and 
immunoprecipitates were immunoblotted with anti-B; or anti-Go,3. 

b, c, Washed human platelets were stimulated with 0.025 U ml! o-thrombin 
with or without adding 2 mM EDTA (an inhibitor of the ligand binding 
function of integrins), stirred (1,000 r.p.m.) at 37 °C, solubilized at various time 
points, and immunoprecipitated with anti-B; or equal amounts of pre-immune 
rabbit serum. Lysates (10%) and immunoprecipitates were immunoblotted 
with anti-Go,3, anti-talin or anti-f3 antibodies. b, Western blot results. 

c, Turbidity changes in platelet suspension indicating integrin-dependent 
platelet aggregation. Note the inhibitory effect of EDTA on talin dissociation 


and Ga; binding to B3. d, As additional controls for Fig. 2a to exclude the 
possibility of loss of talin and B; in platelet lysates to insoluble fraction during 
integrin signalling, washed human platelets were stimulated with 0.025 U ml — 
a.-thrombin in the absence or presence of 2 mM integrin inhibitor RGDS, 
stirred (1,000 r.p.m.) at 37 °C, and then solubilized at various time points as in 
Fig. 2a. Solubilized platelets were centrifuged at 14,000g for 10 min to separate 
lysates from insoluble pellets. Pellets were dissolved in SDS sample buffer to the 
same volume as the lysates after diluting them 1:1 with 2 SDS sample buffer, 
and both were immunoblotted with anti-f and anti-talin antibodies. Note that 
the levels of talin and f; in platelet lysates kept essentially constant during the 
course of platelet aggregation and, with low concentrations of thrombin used to 
stimulate platelets, very little insoluble B; and talin were present in the pellet, 
which were detectable only after prolonged exposure (5-min exposure 
compared to 10s of normal exposure time) and with no obvious variation 
during the course of platelet aggregation. 
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Extended Data Figure 4 | Effects of shRNA-induced talin knockdown and 
talin knockout on integrin signalling. a, Western blot comparison of talin 1 
expression levels in mouse platelets derived from control shRNA- or talin- 
shRNA-transfected bone marrow stem cells. Western blots of Ga,3, and 
integrin B, and B; are also shown. b, Adhesion of unstimulated mouse platelets 
to immobilized fibrinogen for 1h. Adherent platelets were quantified as 
percentage of total platelets loaded (mean + s.d.,n = 4). c, Turbidity changes in 
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mouse platelet suspension stimulated with 5 1M ADP in the presence of 

20 1g ml’ fibrinogen, with or without 1 mM MnCl, as detected using an 
aggregometer. d, Fluorescence microscopy images of phalloidin-stained mouse 
platelet spreading on fibrinogen for 1 h, with or without 1 mM MnCh. 

e, Quantification of surface areas of individual adherent platelets as shown in 
Fig. 2g (mean + s.e.m.). 
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Extended Data Figure 5 | Effects of AAA mutation on integrin outside-in 


signalling in platelets. a, Flow cytometric analysis of integrin op,83 expression 
levels in B; ‘~ mouse platelets transfected with wild-type or AAA mutant B; 


using bone marrow stem cell transplantation technology in comparison with 


C57BL/6 mouse platelets. B; /~ platelets were used as a negative control. o,{3 


complex was detected using an anti-mouse oy, antibody. b, Mouse platelets 
expressing recombinant wild-type or AAA mutant f as in a were lysed and 
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immunoprecipitated with anti-B; or equal amounts of pre-immune rabbit 
serum. Lysates (10%) and immunoprecipitates were immunoblotted with anti- 
SRC or anti-B; antibodies. c, d, Spreading of phalloidin-stained wild-type 
platelets (EEE), AAA mutant platelets and AAA mutant platelets incubated 
with mP,Scr or mP, on immobilized fibrinogen for 1 h. c, Typical fluorescence 
microscopy images. d, Quantification of surface areas of individual platelets 
(mean = s.e.m.). 
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Extended Data Figure 6 | Effects of mutational disruption of the EXE motif 
on integrin outside-in signalling. a, Expression levels of wild-type or the EXE 
motif (QSE, DED or AAA) mutants of 83 in complex with wild-type op, in 
CHO cells, as determined by flow cytometry. Mouse IgG was used as a negative 
control. b, c, Spreading of CHO-1b9 cells expressing wild-type 01,83, and QSE, 
DED or AAA mutant o47,f3 on fibrinogen for 1 h. b, Quantification of surface 
areas of individual cells (mean + s.e.m.). ¢, Typical microscopy images. d, Flow 
cytometric analysis of wild-type %,83, AAA or Y747A mutant on,83 
expression in CHO cells. Mouse IgG was used as a control. e, CHO cells 
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expressing wild-type, AAA or Y747A B; without (top panels) or with (bottom 
panels) co-expression of recombinant THD were solubilized and 
immunoprecipitated with anti-B; or pre-immune serum. 10% lysates and 
immunoprecipitates were immunoblotted with anti-talin, anti-Go.; or anti-B; 
antibodies. f, Typical western blots for Fig. 3g. Wild-type or AAA-mutant- 
Op B3-expressing CHO-1b9 cells were allowed to adhere to immobilized 
fibrinogen, solubilized at various time points, and analysed for RHOA 
activation and SRC Tyr 416 phosphorylation. 
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Fits 
Extended Data Figure 7 | mP, selectively inhibits integrin outside-in 
signalling without affecting inside-out signalling. a-d, Washed human 
platelets were stimulated with 0.025 U ml! a-thrombin in the absence or 
presence of 250 uM myristoylated peptides, mPj3 (a, b) and mP, (c¢, d) with 
stirring (1,000 r.p.m.) at 37 °C, and then solubilized at various time points. 
Lysates were immunoprecipitated with anti-B; rabbit serum or equal amounts 
of pre-immune serum. Lysates (10%) and immunoprecipitates were 
immunoblotted with anti-Go;3, anti-talin or anti-§3 antibodies. a, c, Typical 
western blot results. b, d, Typical turbidity changes in platelet suspension 
indicating integrin-dependent platelet aggregation. e, Quantification of human 
platelet spreading on immobilized fibrinogen for 1 h, without or with treatment 


w° ia! wo? a0? 
FLI-Height 
with DMSO, mP,Scr, or mP, as shown in Fig. 4b (mean surface area + s.e.m.). 
f, Flow cytometric analysis of PAR4-AP-induced Oregon Green-labelled 
soluble fibrinogen binding to human platelets pre-treated with 100 uM mP,Scr 
or 100 1M mPg stimulated with increasing concentrations of PAR4-AP. 
Integrilin-treated platelets were used as a negative control. g, Flow cytometric 
analysis of 100 1M PAR4-AP-induced PAC1 binding to human platelets pre- 
treated with 100 1M mP,Scr or mP.. Integrilin-treated platelets were used as 
negative control. h, Flow cytometric analysis of PAR4-AP-induced Oregon 
Green-labelled soluble fibrinogen binding to human platelets pre-treated with 
solvent DMSO, mP,;Scr or mP)3. Resting platelets were used as a negative 
control. 
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Extended Data Figure 8 | The in vivo effect of mP,: selective inhibition of 
thrombosis but not haemostasis. a, Representative images of laser-induced 
mouse cremaster arteriolar thrombosis (red) in the context of the bright-field 
microvascular histology, visualized by infusion of nonblocking rat anti-mouse 
GPIbf antibody conjugated to DyLight 649. The C57BL/6 mice were injected 
with 5 umol kg ! micellar formulated mP, or mP,Src (negative control), 

12 mol kg Integrillin or buffer, 3 min before laser-induced arteriolar wall 
injury. White arrows indicate the directions of the blood flow. b, The mean 
platelet fluorescence intensity for 30 thrombi (performed in three mice) for 


each treatment at selected time points (mean + s.e.m., n = 30, t-test). 
Fluorescence in mP¢- and Integrilin-treated mice is minimal. c, Comparison of 
mP, (5 umol kg” ') with the same dose of Integrilin and their respective 
controls in occlusion time of FeCl3-induced carotid artery thrombosis in mice. 
Typical arterial blood flow charts of FeCl,-induced occlusive thrombosis are 
shown. d, Comparison of mP, (5 umol kg” ') with the same dose of Integrilin 
and controls in mouse tail bleeding analysis. Released haemoglobin levels were 
used as a parameter to assess blood loss (mean = s.d., n = 10). 
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Extended Data Figure 9 | Platelet uptake of mP, and mP,Scr, and no effect 
of mP6 on hemogram. a, Estimation of intracellular levels of 
1-pyrenyldiazomethane (PDAM)-conjugated mP, and mP,Scr following 
incubation with platelets for 5 min. Platelets were pelleted by centrifugation, 
and the amounts of PDAM-conjugated peptides in platelet lysates were 
estimated (mean + s.d.,n = 3). b, Haemogram of mouse whole blood before or 
1h after injection of mP, or mP,Scr (5 umolkg '), showing no significant 
differences. 
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LARGE glycans on dystroglycan function as a tunable 
matrix scaffold to prevent dystrophy 


Matthew M. Goddeeris!”, Biming Wuy, David Venzke!?, Takako Yoshida- Moriguchi”, Fumiaki Saito*, Kiichiro Matsumura’, 


Steven A. Moore* & Kevin P. Campbell’'>° 


The dense glycan coat that surrounds every cell is essential for cellular 
development and physiological function’, and it is becoming appre- 
ciated that its composition is highly dynamic. Post-translational addi- 
tion of the polysaccharide repeating unit [-3-xylose-a1,3-glucuronic 
acid-f1-],, by like-acetylglucosaminyltransferase (LARGE) is required 
for the glycoprotein dystroglycan to function as a receptor for proteins 
in the extracellular matrix”’. Reductions in the amount of [-3-xylose- 
a1,3-glucuronic acid-f1-],, (hereafter referred to as LARGE-glycan) on 
dystroglycan result in heterogeneous forms of muscular dystrophy*. 
However, neither patient nor mouse studies has revealed a clear 
correlation between glycosylation status and phenotype™®. This dis- 
parity can be attributed to our lack of knowledge of the cellular 
function of the LARGE-glycan repeat. Here we show that coordi- 
nated upregulation of Large and dystroglycan in differentiating 
mouse muscle facilitates rapid extension of LARGE-glycan repeat 
chains. Using synthesized LARGE-glycan repeats we show a direct 
correlation between LARGE-glycan extension and its binding capa- 
city for extracellular matrix ligands. Blocking Large upregulation 
during muscle regeneration results in the synthesis of dystroglycan 
with minimal LARGE-glycan repeats in association with a less com- 
pact basement membrane, immature neuromuscular junctions and 
dysfunctional muscle predisposed to dystrophy. This was consistent 
with the finding that patients with increased clinical severity of 
disease have fewer LARGE-glycan repeats. Our results reveal that 
the LARGE-glycan of dystroglycan serves as a tunable extracellular 
matrix protein scaffold, the extension of which is required for nor- 
mal skeletal muscle function. 

LARGE is a dual-function glycosyltransferase that adds a glycan 
repeat to the basement membrane receptor dystroglycan (DG). DG is 
comprised of a transmembrane -subunit and a cell-surface-associated 
a-subunit—the LARGE-glycan is bound to the latter through a rare 
structure at the amino terminus of its mucin-like domain, a phosphory- 
lated O-mannosyl glycan’*. The LARGE-glycan binds, with high affi- 
nity, to laminin-G-domain-containing matrix proteins (including lami- 
nin, agrin, perlecan and neurexin® |”). These interactions can be directly 
competed by the antibody IIH6, which specifically recognizes the LARGE- 
glycan. The amount of LARGE-glycan repeat present on 4-DG has 
remarkable variability—both developmental’*"* and tissue-specific’’, 
as shown by marked differences in «-DG apparent molecular mass. 
Mutations in LARGE, the DG-encoding gene (DAG1), and several 
other genes involved in synthesizing the O-mannosyl glycan’* result 
ina group of muscular dystrophies termed dystroglycanopathies. These 
disorders encompass a broad spectrum of severity, ranging from late 
adult onset with predominant weakness of proximal muscle groups to 
more severe forms that present at birth and sometimes include defects 
in brain and eye development. Features common to these disorders are 
reduced ligand binding by «-DG and reduced to absent ITH6 immuno- 
reactivity. However, when a cohort of dystroglycanopathy biopsies 


was assessed for ITH6 immunoreactivity by microscopy a correlation 
between staining intensity and clinical severity was not found’. 

It has been proposed that «-DG has separable, currently unknown roles 
in developing and fully differentiated skeletal muscle**”*; this could explain 
some of the observed clinical heterogeneity among dystroglycanopathy 
patients. To address this possibility without interfering with «-DG eunng 
embryogenesis, we developed the GT(ROSA)26Sor'7"107 1/1210 RNAt Large 
(hereafter called Large”) inducible mouse model, in which Large is 
knocked down systemically through RNA interference (RNAi) (Extended 
Data Fig. 1a, b). On a standard diet, heterozygotes are phenotypically 
indistinguishable from littermates and «-DG glycosylation is normal 
(Extended Data Fig. 1c, d). When doxycycline is introduced through the 
diet, however, Large transcription is downregulated and ITH6 immuno- 
reactivity in skeletal muscle and elsewhere is lost (Fig. 1a, b). Skeletal 
muscle has an extraordinary capacity for regeneration, a process that 
involves triggering satellite cell differentiation into myoblasts, which 
then proliferate and undergo myogenesis. We designed an experiment 
to test the impact of disrupted «-DG on regeneration, comparing muscle 
formed with normal versus defective «-DG. Specifically, we used Naja 
nigricollis cardiotoxin snake venom (CTX) to induce regeneration in 
select muscles of otherwise healthy adult Large“? mice, and simulta- 
neously began Large knockdown (Fig. 1c). Although §-DG was preserved 
in Large*” muscle through 3 months after induction, LARGE-glycan was 
nearly eliminated (Fig. 1d). Gross analysis and quantification of mul- 
tiple parameters in muscle from CTX-injected Large“? mice and unin- 
jected Large“? and littermate controls (both CTX and uninjected) 
revealed that the test mice were profoundly dystrophic (Fig. 1d-g). 
Some uninjected Large” muscle myofibres featured centrally loca- 
lized nuclei, indicating that the myofibre had previously regenerated, 
but evidence of active necrosis or regeneration (@MHC+ myofibres) 
was minimal despite the absence of LARGE-glycosylated «-DG. From 
these results we proposed that the degree to which a patient mutation 
affects «-DG LARGE glycosylation during muscle formation is a major 
determinant of disease severity. 

Physiological assessment at earlier time points confirmed that LARGE- 
glycan is critical for muscle regeneration. Bilaterally CTX-injected LargeX? 
mice had significant deficits in downhill running within 3 weeks (Fig. 2a), 
whereas uninjected mice maintained running capacity in spite of a 
significant reduction in normally glycosylated %-DG (Extended Data 
Fig. 2a). Notably, Large“? muscles recovering from CTX injury had 
abundant ITH6-positive «-DG, although of a reduced molecular mass 
(Fig. 2b and Extended Data Fig. 2a, b). Despite its low molecular mass, 
this «-DG was competent to bind laminin (Fig. 2b) and localized to the 
sarcolemma correctly (Fig. 2c). These results indicated that the number 
of «-DG LARGE-glycan repeats is critical to the cellular function of 
a-DG during muscle regeneration. 

Histological examination of regenerating Large” muscles early after 
CTX treatment revealed that myofibres regenerate without a delay, but 
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adipose cells, a feature commonly seen in severe muscular dystrophies, 
were observed (Extended Data Fig. 2c, d). Staining for perlecan, which 
normally co-localizes with laminin in the basement membrane, was 
reduced in intensity at the basement membrane but was also found 
mislocalized to the collagen-rich, extracellular matrix layer known as 
the endomysium (Extended Data Fig. 3a—e). Laminin «2 was similarly 
mislocalized to the endomysium (Extended Data Fig. 3f). Given that 
these findings were indicative of basement membrane abnormalities, 
we used transmission electron microscopy to probe for defects at 
the ultrastructural level. The presence of centrally localized nuclei, 
an increase in the presence of fibroblasts and macrophages, and an 
increase in the number of mitochondria at the myofibre periphery 
confirmed that the regions imaged had been damaged by CTX. In 
CTX-injected Large” muscle, the basement membrane was signifi- 
cantly thickened and, unlike that in CTX control muscles, was often 
composed of multiple layers (Fig. 2d and Extended Data Fig. 3g). In 
addition to these abnormalities, collagen fibrils in the endomysial space 
were unusually abundant and often abnormally oriented with respect 
to the myofibres, consistent with the fibrous appearance of collagen 
structures by immunofluorescence (Extended Data Fig. 3a). The expres- 
sion of collagen VI, perlecan and agrin was unaltered in CT X-injected 
Large” muscles, although laminin isoforms were found to be elevated 
at both the mRNA and protein levels (Extended Data Fig. 4 and Fig. 2b). 
The observed layering of the basement membrane and the increase in 
thickness is consistent with reduced compaction of laminin and the 
collagen superstructures during basement membrane formation. 


d, Representative images of IIH6 and B-DG 
reactivity in immunofluorescence- and 
haematoxylin-and-eosin (H&E)-stained sections 
(black arrows, centrally nucleated myofibres). 
e-g, Tibialis anterior muscle sections (4 sections/ 
muscle) of control and Large®? aoe (white and 
black bars, respectively, Large“” mouse, n = 3; 
control, n = 4 biological replicates) were assessed 
for: e, fibre diameter (interaction P < 0.001, 
~2,000 fibres per group; diamonds represent mean 
fibre diameter); f, average regions of necrosis 
(interaction P = 0.057); and g, average number of 
recently regenerated, embryonic myosin-positive 
2 fibres (interaction P = 0.007). Error bars represent 
s.d. in e, and s.e.m. elsewhere. Post-hoc 
comparisons *P < 0.05, ** P< 0.0001; NS, not 
significant; ND, not detected. Scale bars, 100 um. 
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We next assessed the mechanism that underlies the variability in the 
molecular mass of «-DG. In regenerating control mice, «-DG shifts 
from approximately 100 kDa to 156 kDa (mode value) within 10 days 
of injury (Fig. 3a). This shift is closely paralleled by increases in the 
expression of basement membrane components laminin and collagen 
VI (Extended Data Fig. 5a), and by other biochemical hallmarks of 
regeneration such as a gradual loss of embryonic myosin and restora- 
tion of dihydropyridine receptor expression (Fig. 3a). To investigate 
such changes in greater detail we used the C2C12 mouse myoblast cell 
line, a classical model of myogenesis, which shows a marked increase 
in DG protein levels upon serum restriction’’. The transition to larger 
glycoforms occurred rapidly after myoblasts are induced to differenti- 
ate, preceding myoblast fusion into myotubes on differentiation day 
(DD) 2 by 24h (Fig. 3b and Extended Data Fig. 5b). The modal «-DG 
glycoform increased by 18kDa over the course of the experiment 
(Extended Data Fig. 5c). This change correlated with rapid and sus- 
tained increases in Large and Dagl expression (Fig. 3c). Other modi- 
fiers of «-DG also showed increased expression levels but later and/or 
less acutely (Extended Data Fig. 6). We confirmed that other forms of 
a-DG glycosylation were not responsible for the increase in molecular 
mass, through enzymatic de-glycosylation removing the N-glycans, cer- 
tain o-DG mucin O-glycans and terminal trisaccharides of O-mannosyl 
tetrasaccharides (a treatment that spares the phosphorylated O-mannosyl 
linked LARGE-glycan’). Notably, although both DDO and DD4 «-DG 
migrated faster after de-glycosylation, the difference in molecular mass 
between the samples remained, as did the glycoform variation (band 
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smearing) within each sample (Fig. 3d). We conclude that both inter- 
and intra-o%-DG molecular mass variability during myogenesis is due 
to differences in the quantity of LARGE-glycan repeats. Next we mea- 
sured inorganic phosphate chemically released from «-DG isolated 
from C2C12 DD1 (myoblasts) and DD5 (myotubes) after acid hydro- 
lysis in conjunction with the malachite green phosphate assay. We found 
no significant difference (6.15 + 2.37 LM versus 6.83 + 2.19 uM phos- 
phate released per unit DG for DD1 and DDS, respectively, n = 3 
trials). Thus, the increase in a-DG LARGE-glycan content during 
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Figure 2 | Regenerating Large” muscle has 
a-DG of reduced molecular mass and basement 
membrane defects. a, Quantification of muscle 
performance in downhill treadmill assay. Mice 
were injected bilaterally in the tibialis anterior and 
gastrocnemius muscles, and were given a 21-day 
recovery period. Grey line indicates predetermined 
assay end point (performance averages, interaction 
between genotype and CTX injury P = 0.015, post- 
hoc comparisons *P < 0.05, ** P< 0.0001, error 
bars indicate s.e.m., n = biological replicate 
number for trial depicted, 3 experimental 
replicates). b, Western blot analysis and laminin 
overlay assay prepared from regenerating muscle, 
each lane representing an individual animal. CL, 
contralateral (un-injured) muscle; *n.L, native 
laminin. c, IIH6 and B-DG immunoreactivity of 
muscle sections. d, Muscle ultrastructure as 
assessed by transmission electron microscopy. 
White arrowhead, sarcolemma; black arrows, 
basement membrane; CB, collagen bundles. 
Samples in b-d were taken 10 days after CTX 
injury; scale bars, 50 um (c) and 0.5 um (d). 
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We proposed that a reduction in the LARGE-glycan degree of poly- 


merization results in «-DG with reduced ligand-binding capacity. 
Using synthesized LARGE-glycan chains of either low (up to 13 
repeats) or high molecular mass (>13 repeats) immobilized on ELISA 
assay plates, we performed binding assays for laminin, ITH6 and the 
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perlecan V domain (Fig. 3e, fand Extended Data Fig. 7a—c). Binding to 


Figure 3 | Increase in a-DG LARGE-glycan 
chain length during myogenesis enhances 
ligand-binding capacity. a, Western blot analysis 
of WGA-enriched samples for expression of DG 
and other proteins at multiple times after CTX- 
mediated injury to C57BL6/J mice, each lane 
representing an individual animal. b, Western blot 
analysis of WGA-enriched lysates from C2C12 
myoblasts undergoing myogenesis for functional 
a-DG (DD, differentiation day). At DD4-5 the 
loading control, B-actin, is downregulated in 
myotubes. c, qPCR analysis of Dagl and Large 
during C2C12 myogenesis (averages reported, 
P<0.001, n = 3 biological replicates, n = 7 
technical replicates, relative to Rp/4 expression, 
error bars indicate s.e.m.). d, Western blot 
demonstrating effects of enzymatic de- 
glycosylation of DDO and DD4 C2C12 protein 
samples. e, f, Elution profile of LARGE-glycan 
repeats on a gel filtration column. Bars indicate the 
fractions that were collected as the high- and low- 
molecular-mass LARGE repeats (S, substrate) used 
for solid-phase laminin 111 (f, high molecular mass 
K, = 8.2) binding. g, DG western blot in DD4 
myotubes from Large“? (+1 pg ml * doxycycline) 
and control cultures. h, Solid-phase laminin 111 
binding demonstrating that binding capacity is 
dependent on extension of the LARGE-glycan 
(control Ka = 2.93 = 0.776 nM; Large®? 

Kqa= 4.75 + 1.24nM). i, Comparison of solid- 
phase determined relative B,,,, values for several 
a-DG ligands (averages represented, By... Values 
for Large“? were set to 1 to allow for direct 
comparisons; error bars indicate s.e.m.). For solid- 
phase assays (f and h), error bars indicate s.e.m., 
n = 3 technical replicates, curve fitting to equation 


f= Bmax*abs(x)/(Ka + abs(x)). 


the low-molecular-mass LARGE-glycan was limited compared to that 
for the high-molecular-mass LARGE-glycan. Next, myotubes gener- 
ated from primary cultures of isolated control and Large*” satellite 
cells were collected for «%-DG solid-phase binding assays (Fig. 3g). The 
binding capacity (relative Bmax) of a-DG from the Large” myotubes 
was reduced for multiple ligands, including recombinant «1LG4-5 
(2.04-fold), the perlecan V domain (2.47-fold), and laminin 111 (3.55- 
fold); the latter also displayed a small reduction in dissociation con- 
stant (Kg) (Fig. 3h and Extended Data Fig. 7d-f, summarized in Fig. 3i). 
These data demonstrate that the degree of LARGE-glycan polymeriza- 
tion, and thus the molecular mass of «-DG, correlates directly with the 
ligand-binding capacity of «-DG. 

We reasoned that extension of the LARGE-glycan repeat chain 
during regeneration increased ligand-binding capacity by providing 
a larger scaffold for basement membrane components. The neuromus- 
cular junction (NMJ) contains a basement membrane especially rich in 
a-DG ligands. According to our hypothesis, this structure should be 
disrupted in Large” mice. Analysis of mice by tail suspension was con- 
sistent with such NMJ abnormalities in regenerated muscles; Large<” 
mice subjected to CTX injections into the tibialis anterior and gastro- 
cnemius muscles clasped their hind paws instead of splaying them 
outward as was typical for control mice (Fig. 4a). Furthermore, analysis 
of acetylcholine receptor (AChR) labelling in these mice 21 days after 
CTX damage revealed that the NMJs were fragmented and irregular in 
shape (Fig. 4b, c), consistent with impaired maturation at the ultra- 
structural level. Finally, in AChR aggregation assays in primary myo- 
tube cultures, the addition of agrin or agrin plus laminin 111 to the 
medium of control myotubes led to a stepwise increase in AChR aggre- 
gate number and size, as previously demonstrated”, but these increases 
were absent in Large“” myotubes (Fig. 4d, e). These findings demon- 
strate that normal NMJ maturation requires a high degree of LARGE- 
glycan polymerization, and that in the absence of such extension ligand 
saturation occurs. 

Our data demonstrate that LARGE-glycan extension is required for 
reformation of both normal NMJs and the basement membrane after 
injury. Moreover, they indicate that the degree of LARGE-glycan poly- 
merization is temporally coordinated with muscle regeneration. We 
thus tested whether the presence of excess LARGE-glycan early during 
regeneration would be detrimental. The overexpression of Large (in 
CAG-Large transgenic mice) resulted in hyper-glycosylated muscle 
a-DG, both at steady state (muscles not subjected to CTX injury) and 
shortly after CTX-induced muscle injury (Extended Data Fig. 8a, b). 
Glycosidase treatment of wheat-germ-agglutinin-enriched samples indi- 
cated that off-target activity owing to Large overexpression was unlikely 
(Extended Data Fig. 8c, d), although we cannot entirely exclude a small 
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degree of nonspecific LARGE activity. Regenerated CAG-Large myo- 
fibres were reduced in diameter and cross-sectional area, and myofibre 
density was increased, indicative of a defect in myoblast fusion (Extended 
Data Fig. 8e-i). These data support the notion that the pace of increased 
LARGE-glycan polymerization is an important aspect of «-DG func- 
tion during muscle regeneration. 

We next assessed human muscle biopsies from several forms of 
muscular dystrophy. Samples from two broad classes of muscular dys- 
trophy were examined: more severe congenital forms (early-childhood 
onset, impact on non-muscle organs; congenital muscular dystrophy 
(CMD)), anda less severe form (adult-onset, impacting proximal mus- 
cles to a greater extent; limb girdle muscular dystrophy 21 (LGMD2))). 
We observed a reduction in the molecular mass of ITH6-positive x-DG 
in the congenital muscular dystrophy muscle only (Extended Data 
Fig. 9a); whereas very little IIH6-positive o-DG was observed in the 
LGMD2I samples, the residual protein present was of typical molecular 
mass and restricted (as assessed by immunofluorescence) to muscle 
regions undergoing regeneration (Extended Data Fig. 9b). These results 
are in accordance with a previous study’’. Consistent with our findings 
from Large“? mice, mislocalization of perlecan to the endomysial 
space was observed only in biopsies where «-DG was of reduced 
molecular mass (Extended Data Fig. 9c, d). Despite the fact that endo- 
mysial fibrosis in the LGMD2I biopsy was extensive, perlecan locali- 
zation was normal, that is, restricted to the basement membrane. These 
data suggest that the typical late onset of LGMD21 is caused by loss of 
functional %-DG in the context of normal basement membrane forma- 
tion. A previous analysis demonstrated that IIH6-positive «-DG could 
be recovered with LARGE overexpression in dystroglycanopathy cell 
lines’’. In light of our results, we propose that in some dystroglycano- 
pathies the native increase in LARGE function during myogenesis 
provides sufficient LARGE-glycan during muscle regeneration to lessen 
the impact of certain disease-causing mutations. If true, this would mean 
that the degree to which LARGE-glycan polymerization is reduced 
during muscle regeneration is a critical determinant of clinical severity. 

Our findings demonstrate that the LARGE-glycan repeat of x-DG is 
a tunable scaffold for extracellular matrix proteins, the restriction of 
which through Large knockdown during muscle regeneration both 
reduces muscle physiological function and predisposes it to dystrophy. 
Whereas the lower molecular mass «%-DG retains the ability to bind 
laminin, our data are consistent with it binding fewer ligands per «-DG 
protein. In addition to binding the laminin network of the basement 
membrane, the LARGE-glycan also binds agrin and perlecan, which 
are capable of binding laminin and collagen concurrently. We specu- 
late that when sufficient LARGE-glycan is available, these collateral 
linkages facilitate compaction of the basement membrane layers 


Figure 4 | Reduction in a-DG LARGE-glycan 
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(Extended Data Fig. 10a, b). We also note that basement membranes 
from different tissues vary in both thickness and elasticity”, and that 
a-DG is differentially glycosylated in a manner that affects ligand- 
binding characteristics (due mainly to differences in LARGE-glycan) 
across many tissues'>*°(Extended Data Fig. 10c, d). It is thus possible 
that the capacity to fine-tune extension of the «-DG LARGE-glycan 
contributes to diversity in basement membrane structures across 
mammalian tissues. 


METHODS SUMMARY 

All mouse studies were performed on adult (8-15-week-old) mice raised on a 
standard mouse diet. 10 1M CTX (Accurate Chem & Sci Co in phosphate buffered 
saline) was used for intramuscular injections, at the following volumes: tibialis 
anterior, 25 1l; gastrocnemius, 50 1l. Injections of phosphate buffered saline alone 
did not cause observable muscle damage. Injections were unilateral unless other- 
wise noted. Mouse and human protein samples were processed similarly after 1% 
Triton X-100 solubilization of physically disrupted tissue. Glycoprotein enrich- 
ment via wheat-germ agglutinin (WGA) agarose beads (Vector Labs) was per- 
formed as described previously’. Induction of Large knockdown was achieved by 
replacing the normal mouse diet with equivalent chow containing 1 gkg™' dox- 
ycycline (Bio-Serv) at the time of CTX injection. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Experimental replicates. Unless otherwise stated, all experiments were repeated 
in the laboratory three times. Data reported are representative. 

Animals. Animal care, ethical usage and procedures were approved and per- 
formed in accordance with the standards set forth by the National Institutes of 
Health and the Animal Care Use and Review Committee at the University of Iowa. 
At the University of Iowa all mice are socially housed (unless single housing is 
required) under specific pathogen-free conditions in an AAALAC accredited 
animal facility. Housing conditions are as specified in the Guide for the Care 
and Use of Laboratory Animals (NRC). Mice are housed on Thoren brand, HEPA 
filtered ventilated racks, in solid bottom cages with mixed paper bedding. A standard 
12/12-h light/dark cycle was used. Standard rodent chow (or special diet if required) 
and water is available ad libitum. C57BL6/J and Large” mice were obtained from 
The Jackson Laboratory. Large” mice were generated at TaconicArtemis GmbH. 
CAG-Large transgenics were generated by ligating a 1.7-kb fragment of the CAG 
promoter to a 2.3-kb fragment corresponding to positions + 174 through +2444 of 
the Large (NM_010687) cDNA, plus the rabbit B-globin polyadenylation signal. 
The linearized 4.6-kb Sall-Stul CAG-Large construct was injected into pronuclei of 
fertilized zygotes from C57BL/6NTac mice and transferred to pseudopregnant 
females. Offspring were screened for genomic integration of this fragment by 
PCR of tail DNA, using the following CAG-Large-specific primers (PCR product 
size of 2.5 kb): forward 5’-CCTACAGCTCCTGGGCAACGTGCTGGTT-3’, reverse 
5'-AGAGGGAAAAAGATCTCAGTGGTAT-3’. Mice were generated by breed- 
ing F, heterozygous transgenic males to wild-type females. 

Mice were bred onto a C57BL6/J background (backcross 6 or greater). All 
mouse studies were performed on adult (8-15-week-old) mice raised on a stand- 
ard mouse diet. Whereas within each experiment all subjects were sex- and age- 
matched (littermates where possible), both male and female mice were used and 
found to respond similarly with the exception of the downhill running assessment 
(Fig. 2a) wherein only male mice were used as performance variability was noted 
between the sexes. Previous experience with standard deviation of given tech- 
niques and pilot studies using naive animals were used to determine required 
power for experiments where appropriate with all efforts made to ensure animals 
for a given experiment were littermates or within 1 week of age. All treatment 
group designations (randomization) were assigned based on mice identification 
number and genotype information before experimenter’s observation of the ani- 
mals to exclude any bias based on animal appearance. 10 uM CTX (Accurate 
Chem & Sci Co) in phosphate buffered saline (PBS) was used for intramuscular 
injections, at the following volumes: tibialis anterior, 25 ul; gastrocnemius, 50 pl. 
Injections of PBS alone caused no observable muscle damage. Injections were 
unilateral unless otherwise noted. Induction of knockdown was achieved by repla- 
cing the normal mouse diet with an equivalent chow containing 1 gkg_* doxycy- 
cline (Bio-Serv) at the time of CTX injection (day 0). Animals were exercised by 
downhill running (15° grade), using a variable speed belt treadmill (Omnipacer, 
AccuScan Instruments, Inc.) by an experimenter blinded to the genotypes of 
individual animals. Warm-up: 5 min, 3 metres per min (m.p.m.); running at the 
following paces; 10 m.p.m., 15 m.p.m., 20 m.p.m. for 5 min each, and 25 m.p.m. 
for 10 min or to exhaustion. Exhaustion was scored as 10 consecutive seconds of 
non-performance. 

Human subjects and samples. Muscle biopsies were originally collected for dia- 
gnostic purposes and were obtained and tested according to the guidelines set out 
by the Human Subjects Institutional Review Board of the University of Iowa; 
informed consent was obtained from all subjects or their legal guardians. 

Cell culture. C2C12 mouse myoblasts (freshly purchased from ATCC) were 
maintained in DMEM containing 10% FBS, 1% L-glutamine at 37°C in 5% 
CO,, and kept to a maximum of eight passages. For differentiation, cells were 
grown to confluence (designated differentiation day 0, DDO) and then switched 
to DMEM with 2% donor equine serum. AraC (cytosine f-b-arabinofuranoside 
hydrochloride, 10 nM final, Sigma) was added to differentiating C2C12 cells after 
myotubes formed, to limit the proliferation of differentiation-incompetent myo- 
blasts. Satellite cell isolation was performed as per ref. 31. Satellite cells were plated 
on BD Matrigel-coated dishes and activated to differentiate into myoblasts in 
DMEM-F12, 20% fetal bovine serum (FBS), 40 ng ml! basic fibroblast growth 
factor (R&D Systems, 233-FB/CF), 1X non-essential amino acids, 0.14mM 
B-mercaptoethanol, 1X penicillin/streptomycin and Fungizone. Myoblasts were 
maintained with 10 ng ml * basic fibroblast growth factor and differentiated in 
DMEM-F12, 2% FBS, 1X insulin-transferrin-selenium. For AChR-aggregation 
assays, primary myoblasts were seeded on BD Matrigel-coated glass coverslips. 
Muscle analysis. Immunofluorescence and haematoxylin and eosin staining were 
carried out as described previously’. All images are representative from larger 
analysis of at least three animals per group, per condition. Fibre size diameter 
(minimal Feret’s diameter) was assessed using ImageJ software (version 1.45s), 
based on S-DG immunofluorescence. Necrosis was assessed using anti-mouse IgG 
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fluorescent antibody (detects immune-cell infiltration of degenerating fibres). 
qRT-PCR for mouse Dag! and associated glycosyltransferases was performed 
using a BioRad MyIQ system, with protocols and primer sets previously described”. 
The following primer sets were also used: Ispd1 forward 5'-TGGTGTGGATTA 
GGGGGTTA-3’, reverse 5’-TGGCTGCACTTTGTCCTAAA-3’; Tmem5 forward 
5'-GAGAACAGTGGCAGCCTCA-3’, reverse 5’-CAAAGGAGCAGGCCTCAT 
AG-3'; Sgk196 forward 5'-GCTGTCCTGTGAAGAGCTGA-3’, reverse 5’-GGG 
AGAGAGCGACTTTGTGT-3’; Ignt1 forward 5'-ACATTTGACGAACGCTTTC-3’, 
reverse 5'-CCTCCTTTTGGGGATGGAAC-3’; Itga7 forward 5'-TTGCTGTT 
AGCCACGATCAG-3’, reverse 5’-CGCCAGAGAAGAAGAGTTGC-3’; Col6a 
forward 5'-CTCTCCTGGTTCACCCATGT-3’, reverse 5'-CCCGACTCTACCG 
AGATTGA-3'; Lama2 forward 5'-CCAAGAAGGAGGCTGCATAG-3’, reverse 
5'-CCAGGTGTTGGGAAGACACT-3’; Lamb1 forward 5'-GTTCGAGGGAAC 
TGCTTCTG-3’, reverse 5'-GTTCAGGCCTTTGGTGTTGT-3’; perlecan (Hspg2) 
forward 5'-GAGCGGACTGTACCTTGGTC-3’, reverse 5'-ACCAGTTGCACA 
CAGCTCAC-3’; agrin forward 5'-CAGTGGGGGACCTAGAAACA-3’, reverse 
5'-ACCTTTCCAATCCACAGCAC-3’. 

Fold changes were calculated using the AACt method. Failed reactions, those 
with Ct values within 1 Ct value of water control, were excluded. Mouse and 
human protein samples were obtained by 1% Triton X-100 solubilization of phys- 
ically disrupted tissue. Glycoprotein enrichment via WGA agarose beads (Vector 
Labs) was performed as described previously’. Densitometry was performed using 
LiCor Odyssey Software, V3.0, of triplicate blots. Laminin overlay and solid-phase 
assays were performed as previously described*’. For analysis of NMJs, Alexa-488- 
conjugated o.-bungarotoxin was used to stain 4% paraformaldehyde-fixed tibialis 
anterior muscle cut into thirds longitudinally. Labelled muscle was cleared using 
glycerol and flattened. Maximal-intensity projections were obtained using an 
FV1000 Olympus Scanning Confocal laser microscope and z-stack sections com- 
piled with FluoViewer-1.5 (Olympus). Next, individual NMJs from each image 
were assigned a number, cropped and copied onto a scoring PowerPoint file in a 
randomized fashion before scoring. Individual NMJs were scored by three blinded 
observers, using criteria established previously*’. NMJs were scored as ‘fragmen- 
ted’ if they were comprised of 5 or more AChR islands, and ‘irregular” if they were 
abnormally shaped (either shallow folds and involutions or none at all). NMJs 
throughout the muscle were sampled, from 3 animals per group. CAG-Large mice 
were analysed for fibre size diameter (minimal Feret’s), fibre cross-sectional area 
and density from images acquired using a VS120-S5-FL slide scanner microscope 
(Olympus) with VS-ASW (version 2.6). Analysis was carried out using VS- 
Desktop software version 2.6. 

Statistics. All experimental data are representative of repeated experiments. 
Statistical analysis was performed using SigmaPlot software which determines if 
testing assumptions are met (Shapiro—Wilk normality and equal variance testing). 
Figure la used the Student’s t-test, two sided. Figures le, f, 2a and 4c-e were 
analysed by two-way ANOVA. Post-hoc, pairwise multiple comparison proce- 
dures used the Holm-Sidak method with correction for multiple testing. Figure 3c 
data were analysed by one-way ANOVA on ranks. 

Malachite green phosphate assay and sample preparation. C2C12 cells were 
differentiated on three 15-cm dishes and collected by physical scrapping. Cells 
were homogenized with 1% Triton X-100 by vortexing for 5 min, and insoluble 
material was centrifuged and discarded. An initial Lowry protein assay determined 
that starting protein was equivalent in each sample. DG pull-down was carried 
out using a B-DG-specific monoclonal antibody (8D5, Developmental Studies 
Hybridoma Bank) pre-bound to protein-A agarose beads (Santa Cruz Biotech- 
nology). After extensive washing with 0.1% Triton X-100 in Tris-buffered saline, 
DG and the associated proteins were eluted using 0.1 M B-DG peptide. Samples 
were then dialysed exhaustively in ddHO (6,000-8,000 molecular mass cutoff, 
5 days, 12+ 41 changes). Concentrations relative to that of the x-DG core protein 
were determined by ELISA. Equal starting amounts of «-DG were then treated 
with 150 units calf intestinal alkaline phosphatase (CIP, New England Biolabs) for 
30 min at 37 °C, to remove organic phosphate modifications such as the O-linked 
N,N'-diacetyllactosamine modification recently described for «-DG*. To release 
phosphate associated with the LARGE-glycan, samples were incubated with ice- 
cold 48% aqueous hydrofluoric acid at 0°C for 20h. The reagent was removed 
under a steady stream of nitrogen gas while on ice. Control samples were prepared 
by the same procedure, except that ice-cold water was used in place of hydrofluoric 
acid. Re-suspended samples were then assessed for phosphate content using the 
Malachite Green Phosphate Detection kit (R&D Systems). 

LARGE-glycan preparation for the binding assays. 0.2 mM biotinylated GlcA- 
MU (custom synthesis by Sussex Research Laboratories Inc.) was incubated with 
LARGEdTM’ 10mM UDP-glucuronic acid (GlcA), 10 mM UDP-xylose (Xyl), 
10mM MgCl and 10 mM MnCl, in 100 mM 2-(N-morpholino)ethanesulphonic 
acid (MES) buffer (pH 6.5) at 37 °C for 18h. The enzymatic reaction was termi- 
nated by boiling in the presence of 50 mM EDTA. After centrifugation (20,000g for 
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10 min), the supernatant was fractionated using gel-filtration chromatography 
(Superdex peptide 10/300GL, GE Healthcare) as described previously’. The peaks 
eluted from retention time 75-97 min, and from 97-170 min were pooled as low- 
molecular-mass and high-molecular-mass LARGE repeats, respectively, and then 
lyophilized. The LARGE repeats in these two pools were dissolved in water and 
then analysed using a Bruker UltrafleXtreme MALDI-TOF/TOF mass spectro- 
meter in negative reflection mode. «-Cyano-4-hydroxycinnamic acid (saturated in 
a mixture of acetonitrile and 0.1% TFA in 1:1 v/v) was used as the matrix as 
described’. To measure the concentration of MU-linked glycan, the glycan sam- 
ples and known amounts of GlcA-MU were hydrolysed in 1 M of HCl at 95 °C for 
20 min. The amounts of 4-methylumbelliferone released by acid hydrolysis were 
measured using a microplate reader Synergy4 (BioTek) (excitation, 350 nm; emis- 
sion, 450nm). GlcA-MU and low- and high-molecular-mass LARGE repeats 
(0.08 nmol per well) were immobilized on a NeutraAvidin coated 96-well plate 
(Pierce) according to the manufacturer's protocol. The plates were used for the 
solid-phase binding assays as described below. 

Solid-phase binding assays. Myotube culture samples were collected by scraping 
and homogenized as described above. Total protein was quantified by Lowry 
protein assay, and equal amounts of protein were used to enrich for glycoproteins, 
using WGA agarose beads (Vector Labs) as described previously’. The use of 
equivalent amounts of DG was confirmed by ELISA to «-DG core protein and 
western blotting for B-DG. Samples were diluted with Tris-buffered saline 1:50, 
and bound to Costar 96-well, High Bind EIA/RIA plates (50 il per well) overnight 
at 4°C. Unbound sample was carefully aspirated and the plate blocked with 3% 
bovine serum albumin in laminin binding buffer (LBB; 746 mM triethanolamine, 
140 mM NaCl, 1 mM CaCh, 1 mM MgCh, pH 7.6) for 2h at room temperature. A 
series of ligand dilutions was prepared in LBB with a final concentration of 3% 
bovine serum albumin and 2 mM CaCl. Laminin 111 (Invitrogen), the perlecan V 
domain*’, the «1 LG4-5 domain (isolated from conditioned medium from HEK293 
cells secreting Flag-tagged «1LG4-5 domain, collected and enriched as described**), 
and AEBSF-laminin 111 were used in separate binding assays. To generate AEBSF- 
laminin 111, 10 mM serine protease inhibitor p-aminoethylbenzenesulphony] flu- 
oride AEBSF, HCl (Sigma) was incubated overnight on ice with 1 mg mouse laminin 
protein (Invitrogen) and used to inactivate laminin polymerization. AEBSF-treated 
laminin was dialysed to remove free AEBSF with 5 LBB changes of one litre each. 
Binding solutions were incubated with bound sample for 1 h at room temperature. 
Mixtures were aspirated completely and washed three times with 200 pl of 1% 
bovine serum albumin in LBB (wash solutions were aspirated each time). Primary 
antibodies to detect ligands were prepared in 3% bovine serum albumin in LBB and 
incubated for 1h at room temperature. Antibody solutions were aspirated and 
washed before adding horseradish peroxidase conjugated secondary antibody 
(diluted in 3% bovine serum albumin in LBB) and incubating for 30 min at room 
temperature. After washing as before, sample reactions were developed using 
100 pl pre-warmed 1-Step Ultra TMB-ELISA solution (Thermo Scientific) and 
stopped with 100 pl 2 M sulphuric acid. Absorbance was read at 450 nm, using a 
BioTek Synergy 4 microplate reader. For each ligand concentration, non-o-DG 
ligand binding was determined by blocking any LARGE-glycan bound to the plate 
with IITH6 antibody. The absorbance value from ITH6-blocked samples was sub- 
tracted from those of non-blocked samples to determine the specific binding 
activity of a-DG. Data from triplicate samples were assessed and fitted to a 
ligand-binding curve using SigmaPlot software. IIH6 ELISAs were completed 
similarly. 

Tissue-specific a-DG ligand binding. Brains (cerebrum), hearts and quadriceps 
were collected from nine C57BL6/J male mice (9-10 weeks of age) and rinsed with 
ice-cold PBS. Samples were homogenized by Brinkmann Plytron (cardiac and 
skeletal muscle) or Wheaton Overhead Stirrer (brain) in 7.5 volumes of 20 mM 
sodium pyrophosphate, 20mM sodium phosphate monobasic, 1mM MgCh, 
0.303 M sucrose, 0.5 M EDTA, pH 7.0. Samples were spun (15 min, 14,000g, 4 °C) 
and the supernatant was passed through cheesecloth to remove non-homogenized 
material. Total microsomes were obtained from the supernatant through ultra- 
centrifugation (37 min, 142,000g, 4°C) and the microsomes were washed twice 
with 0.303 M sucrose, 0.6 M KCI, 20 mM Tris-maleate, pH 7.0 (recollecting micro- 
somes by repeating ultra-centrifugation each wash) to remove loosely associated 
proteins. Total KCl washed microsomes were re-suspended in 0.303 M sucrose, 
20mM Tris-maleate, pH 7.0 and solubilized with 1% Triton X-100, 0.1% SDS. 
Non-solubilized material was removed by centrifugation (30 min, 30,000g, 4 °C) 
and the solubilized proteins (supernatant) were enriched for glycoproteins by 
WGaA-agarose as described above. Equivalent loading for solid-phase binding 
assays was determined by preparing dilution curves and performing a S-DG 
ELISA assay. Solid-phase assays were performed as described above. 
Glycosidase treatment. Treatment with glycosidases was as described previously’, 
with sialidase (Prozyme) added to the Enzymatic Protein Deglycosylation kit 
(Sigma-Aldrich) containing: PNGaseF, O-glycosidase, -N-acetylglucosaminidase, 


B(1-4)-galactosidase. Glycosidase treatment was carried out on WGA-enriched 
total cell lysates after the native enzymes were heat inactivated (5 min incubation at 
94 °C). Control (non-glycosidase treated) samples were treated identically without 
adding glycosidases. 

AChR aggregation. The AChR aggregation assay was performed on DD4 myo- 
tubes differentiated on Matrigel-coated glass cover slides. Twelve hours before 
fixation, agrin (R&D Systems, 200 pM) and/or laminin 111 (7.5nM) was added 
to the differentiation medium, to stimulate AChR aggregation. Samples were 
incubated for 30min with Alexa-488-conjugated «-bungarotoxin (Invitrogen, 
1:500), washed twice with cytoskeleton stabilizing buffer” (CSB) and fixed with 
2% paraformaldehyde in CSB for 20 min at room temperature. Samples were 
blocked with 10% FBS, 0.3% Triton X-100 in PBS for 10min and incubated 
overnight with ITH6 (1:50) and additional Alexa-488-conjugated «-bungarotoxin 
(1:1,000). Samples were imaged by confocal microscopy. Maximal intensity pro- 
jections were obtained from confocal z-stacks and quantified using ImageJ soft- 
ware (version 1.45s). 

Transmission electron microscopy. Tibialis anterior muscles were isolated from 
mice 10 days after CTX injection, after cardiac perfusion with PBS, and were 
treated with 4% paraformaldehyde in PBS. Samples were fixed in 2.5% glutaral- 
dehyde, 2% paraformaldehyde, 1% tannic acid in 0.1M Na cacodylate buffer 
(pH 7.3) for 1h at 4°C. They were then washed in 0.1 M Na cacodylate buffer 
and post-fixed with 1% osmium tetroxide for 1.5h at 4°C. After serial alcohol 
dehydration (50%, 75%, 95% and 100%), the tissue samples were embedded in 
Epon 12 (Ted Pella). Ultramicrotomy was performed, and ultrathin sections (65 nm) 
were post-stained with uranyl acetate and lead citrate. Samples were examined and 
imaged using a JEOL 1230 transmission electron microscope. Regions of interest, 
which had undergone regeneration due to CTX damage, were identified based on 
the presence of centrally localized nuclei and/or an accumulation of mitochondria 
at the sarcolemma. 

Antibodies. The following antibodies have been described previously and were 
obtained from the listed sources: IIH6** monoclonal antibody (Campbell laboratory); 
monoclonal §-DG antibodies 8D5*°, MANDA G2 7D11*° (Campbell laboratory); 
polyclonal anti-laminin (L9393) (Sigma-Aldrich); anti-Flag (Sigma-Aldrich); 
laminin-2 («2 chain) 4H8-2 (Enzo Life Sciences); perlecan Ab-1 (NeoMarker); 
DHPR rabbit polyclonal antibody" to «2 subunit (Campbell laboratory); affinity 
purified B-DG rabbit polyclonal AP83** (Campbell laboratory); embryonic myosin 
heavy chain developed by H. Blau (F1.653, obtained from the Developmental 
Studies Hybridoma Bank under the auspices of the NICHD and maintained by 
The University of lowa, Department of Biology); core %-DG antibody, RbtG6317” 
(Campbell laboratory); anti-collagen type VI (70R-CRO009x, Fitzgerald Industries 
International); anti-f-actin (clone AC-74, Sigma-Aldrich); and anti-Flag antibody 
(F7425, Sigma-Aldrich). 
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Extended Data Figure 1 | Generation of Large” mice. 

a, GT(ROSA)26Sor"™407 HetO-RNA‘Large (1 argeX?) targeted mice were 
generated at TaconicArtemis GmbH, using an engineered recombinase- 
mediated exchange acceptor site in the Rosa26 gene, and an exchange vector 
with the following elements: a neomycin resistance marker; an H1-Tet-On 
promoter-driven shRNA that targets Large; and a loxP-flanked CAGGS 
promoter-driven Tet repressor. Mouse embryonic stem cells with the above- 
described Rosa26 gene were exposed to six distinct doxycycline-inducible/ 
conditional shRNA constructs that target Large. After validation for correct 
targeting by Southern blotting, the ES cells were evaluated for efficient Large 
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GGAAGTGCGTGTTCGAGAAG ACTTCTCGAACACGCACTTCC 3 
CCTTCACGCACAAGCTCTTCA GAAGAGCTTGTGCGTGAAG 5 
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knockdown by qPCR, and for functional loss of IIH6-positive «-DG by western 
blotting. One knockdown embryonic stem (ES) cell line (targeting the 
sequence: GGAAGTGCGTGTTCGAGAAGT) was selected for use in 
blastocyst injections to generate chimaeric mice, on the basis of the efficiency of 
Large knockdown efficiency therein. Germline animals heterozygous for the 
knockdown cassette were backcrossed onto a C57BL6/J background. b, shRNA 
sequence used to target Large. c, d, Large” mice and littermate controls. In the 
absence of doxycycline, the animals are indistinguishable from control 
littermates both phenotypically (c) and biochemically (d). 
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Extended Data Figure 2 | Reduction in molecular mass of a-DG during and 
after regeneration and increased adipose staining in Large” muscle. 

a, b, Western blot analysis of WGA-enriched Triton X-100-soluble 
homogenates from the tibialis anterior muscles of Large“? mice 21 (a) and 5 
(b) days after CTX injury. Glycosylation of «-DG is ‘re-set’ to a lower molecular 
mass in the Large“? (dox+) mice. At 21 days after CTX injury, the contralateral 
(CL), uninjured Large? (dox+) muscle exhibits reduced IIH6 intensity 
relative to control (dox—), although the molecular mass in the two samples is 
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equivalent. In the 5-day samples, this reduction is not observed, a finding that 
indicates the loss of reactivity in the 21-day samples is attributable to loss of pre- 
existing o-DG and failure to replace with I[H6-positive «-DG owing to Large 
knockdown. At 5 days after injury, a marked increase in molecular mass 
variation is observed in CTX control tissue (similar to Fig. 3a). c, Haematoxylin 
and eosin (H&E) analysis reveals evidence of intramuscular adipose cells in 
Large®? gastrocnemius muscle 10 days after CTX. d, Staining with the 
lipophilic dye Oil Red O, confirming that fat is present. Scale bars, 50 um. 
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Extended Data Figure 3 | Basement membrane abnormalities in Large“? 
muscle. Large“? muscle at 10 days after CTX. a, Collagen VI and perlecan co- 
labelling of muscle sections. Arrowheads, lack of perlecan (normal) in 
endomysial space; arrow, perlecan in the endomysium. b, Secondary only 
control experiment demonstrating specificity of perlecan signal. c, Line scan 
intensity analysis for perlecan staining across adjacent myofibres, conducted by 
a blinded third-party. In the example shown (from a control muscle), increased 
sarcolemmal/basement membrane staining intensity is observed for the two 
adjacent myofibres, with minimal staining in the intervening endomysial space. 
Results of this analysis are quantified in d, e. For perlecan analysis eight line 
scans were conducted on at least three, X20 images from 10-day post CTX 
injury muscles (n = 4 CTX injured mice per group). One-way ANOVA on 
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ranks was used to assess the data. For d and e, **P < 0.001, error bars indicate 
s.e.m. f, Collagen VI, which is expressed during muscle regeneration, localizes 
to both the basement membrane and the endomysium in control and Large*” 
muscle 10 days after CTX. Laminin «2, which is normally restricted to the 
basement membrane in control muscle, is also observed in the endomysium of 
Large®” muscle (arrows). Arrowhead points to endomysium in control muscle. 
All images in a, b and f were collected with confocal microscopy, scale bars, 
50 um. g, Quantification of the increase in basement membrane thickness on 
the basis of transmission electron microscopy of muscle 10 days after CTX 
injury. Over six images from two Large“? and three control animals were 
assessed by a blinded observer using Image] software. One-way ANOVA was 
used to assess the data; **P = 0.001, error bars indicate s.e.m. 
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Extended Data Figure 4 | Expression analysis of Large” muscles repeats. The expression of all genes was compared to that for the reference gene 


undergoing regeneration. RT-PCR-based comparisons of gene expression in Rpl4. Two-way ANOVA was used to analyse the data, and post-hoc 

CTX and CL tibialis anterior muscles 10 days after injury, for Large (a), Dag] comparisons between relevant groups are depicted; *P < 0.05, **P < 0.001, 
(b), collagen VIo (Col6a) (c), laminin «2 (Lamaz2) (d), laminin B1 (Lamb1) NS, not significantly different; error bars indicate s.e.m. 

(e), perlecan (f) and agrin (g). n = 3 biological repeats and n = 3 technical 
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Extended Data Figure 5 | Muscle and basement membrane recovery from 
CTX injury and cellular hallmarks of C2C12 myogenesis. a, Serial sections of 
CTX-injected gastrocnemius muscle stained with haematoxylin and eosin 
(H&E) and with laminin and collagen antibodies. Existing laminin-containing 
basement membranes from degenerated myofibres are observable 4 days after 
CTX treatment. In areas of injury to myofibres (determined by central 
nucleation of myofibres), laminin and collagen VI are increased in intensity and 
appear more diffuse than in nearby uninjured regions. Dotted lines demarcate 
boundaries of damaged regions. Scale bars, 50 im. CL, contralateral. 
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b, Confluent myoblast cultures stimulated to differentiate by switch to low- 
serum medium (viewed by phase contrast). Differentiation day (DD) 0 is 
defined as when C2C12 myoblasts reach confluence in growth medium. 24 h 
after switching to serum-restricted medium (DD1), the myoblasts are 
elongated and begin to align with one another. By DD2, myoblast fusion is 
apparent, and steady myotube growth is observed over the next several days. 
Scale bars, 100 um. c, Representative densitometry-based quantification of 
IIH6-postitive “-DG on DDO (blue line) and DD5 (black). 
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Extended Data Figure 6 | Expression of a-DG-related genes during C2C12 
myogenesis. qRT-PCR analysis for Pomt1 (a), Pomt2 (b), Pomgnt1 (c), fukutin 
(d), Ispd1 (e), Tmems5 (f), Sgk196 (g), Ignt1 (h) and «7 integrin (Itga7, i) over the 
course of C2C12 myogenesis. Although significant changes in expression were 
observed for some genes, the pattern of expression did not match the increase in 
molecular mass observed in «-DG, or the increase observed in the expression of 


1 2 
Differentiation day 


Differentiation day 


Large and Dag!1. Biological and technical samples were assessed in triplicate. 
One-way ANOVA P values are included where significance was found, and 
time points with significantly increased expression (as determined by post-hoc 
analysis) are denoted with asterisks. *P < 0.05, NS, not significantly different; 
error bars indicate s.e.m. 
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Extended Data Figure 7 | Mass spectrometry and binding characteristics of 
in vitro generated LARGE-glycan repeats and associated binding 
characteristics of Large“? a-DG. a, Mass spectrometry analysis of high- and 
low-molecular-mass LARGE-glycan repeats using MALDI-TOF MS. High- 
molecular-mass species are under-represented owing to the difficulty in 
detecting polysaccharides of greater than 10 kDa using this approach’. b, ITH6 
antibody binding to low- and high-molecular-mass LARGE-glycan repeats 
(error bars indicate s.e.m.). c, Solid-phase binding assay for the perlecan V 
domain (error bars indicate s.e.m.). d, Dystroglycan in DD4 myotubes 
generated from isolated primary Large“? (+ 1 1g ml! doxycycline) and 
control satellite cells was assessed for substrate binding. Solid-phase «1LG4-5 
binding demonstrated that binding capacity is dependent on extension of the 
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LARGE-glycan (control Kg = 15.0 + 5.20nM; Large“? Ky = 20.6 + 3.15 nM). 
e, Findings for the perlecan V domain were similar to those of 71 LG4-5 (control 
Kq = 4.37 + 0.749 nM; Large“? Ky = 4.88 + 0.846 nM). f, Laminin 111 self 
assembles, and we wished to test the binding characteristics of a form that is 
unable to do so. AEBSF-laminin 111 is unable to polymerize yet retains receptor 
binding capacity** and when it was tested the differences in binding between 
Large“? and control samples were greater than those for laminin 111. This 
finding indicates that the polymerization of laminin bound to LARGE-glycan 
reduces the apparent differences in binding between Large“” and control %-DG 
(AEBSF-laminin 111 binding control Kg = 7.61 + 2.12nM, Bmax = 1.145 
LargeX? Kg = 17.1 + 7.36nM, Bmax = 0.268). Error bars indicate s.e.m., curve 
fitting to equation f = Byyax*abs(x)/(Ka + abs(x)). 
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Extended Data Figure 8 | Overexpression of LARGE in a transgenic mouse 
results in abnormal muscle regeneration. a, Map of transgene used to 
generate the CAG-Large transgenic mouse. b, Western blot of «-DG from CL 
and CTX injured tibialis anterior muscles 4 days after injury, demonstrating 
that the protein is hyperglycosylated in CAG-Large muscles. c, WGA-enriched 
samples from CAG-Large muscles 10 days after CTX injury were treated with 
enzymatic de-glycosylation to remove N-glycans, certain a-DG mucin 
O-glycans and terminal trisaccharides of O-mannosy] tetrasaccharides while 
sparing the phosphorylated O-mannosyl-linked LARGE-glycan as evidenced 
by the maintenance of IIH6-positive %-DG and reduction in molecular mass of 


B-DG and the IgG present in the samples. d, No change in laminin 111 binding, 
as detected by solid-phase assay, was observed in glycosidase-treated CAG- 
Large samples (error bars indicate s.e.m., n = 4 biological replicates) compared 
to untreated samples indicating that despite Large overexpression, nonspecific 
LARGE modification is minimal in these muscles. e, f, Haematoxylin-and- 
eosin-based histological analysis of muscles 10 days after CTX injury revealed 
an increase in myofibres of small diameter, quantified in g-i. Scale bars 
represent 200 jim. >6,000 fibres per group were measured, n = 3 animals per 
group, *P = 0.006, **P < 0.001 by one-way ANOVA; error bars indicate s.e.m. 
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Extended Data Figure 9 | Increased IIH6 reactivity in regenerating 
myofibres from patients. a, Western blots of WGA-enriched muscle lysates 
from patient biopsies. CMD, congenital muscular dystrophy; LGMD2I, limb- 
girdle muscular dystrophy 21. b, Patient biopsies were analysed by 
immunofluorescence for the presence of the «%-DG LARGE-glycan (using 
antibody ITH6) and for signs of regeneration (using antibody to the embryonic 
myosin heavy chain, eMHC). Increased IIH6 signal correlates with regions of 
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muscle regeneration. In the LGMD2I sample, ITH6 signal is nearly exclusively 
localized to regions of regeneration. c, Representative immunofluorescence 
images from patient muscle biopsies. Insets highlight mislocalized 
(endomysial) perlecan in CMD sample. Intense, localized capillary staining is 
also observed with perlecan antibody. d, Secondary only staining for perlecan in 
human dystrophic muscle sample, demonstrating that the signal in c is specific. 
Scale bars, 100 ptm. 
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Extended Data Figure 10 | Model depicting the role of LARGE-glycan in 
organizing specialized basement membranes. a, In normal skeletal muscle, 
LARGE-glycan on «-DG is abundant and forms long chains, providing many 
binding sites for ligands such as laminin, agrin and perlecan. In addition to 
binding the LARGE-glycan, perlecan and agrin bind (directly, or indirectly via 
accessory proteins like nidogen) to laminin and collagen networks, providing 
collateral linkages as was recently suggested in regards to perlecan**. Increased 
collateral linkage probably enables compaction of the basement membrane. 
b, In Large*” muscle and some dystroglycanopathies, reduced extension of the 
LARGE-glycan during muscle formation results in a reduction in the number 
of binding sites for ligands, and the few collateral linkages between collagen, 
laminin and the sarcolemma are unable to compress the basement membrane. 
Also, because the basement membrane anchors less collagen, this protein 
accumulates in the endomysium where it provides ectopic binding sites for 
perlecan. Agrin is both a high-affinity ligand for the LARGE-glycan and highly 
concentrated at the NMJ. When the number of LARGE-glycan binding sites at 
the NMJ basement membrane is limited, agrin saturation of the NMJ may limit 
the laminin-mediated maturation of the NMJ**. Although DG is depicted as 
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having only two LARGE-glycan chains in this model, it is likely that between 2 
and 5 chains are present per DG molecule in skeletal muscle’. Integrins, which 
are not depicted here, can also bind laminin, and thus could contribute to this 
network; however, mouse studies indicate that its cellular roles are largely 
independent of DG”. c, Representative western blot analysis of x-DG in various 
mouse tissues, demonstrating extensive heterogeneity in molecular mass. 
Tissues were collected from C57BL6/J mice and samples from multiple mice 
were pooled before homogenization and Triton X-100 solubilization. Equal 
amounts of lysate, enriched for glycoprotein by WGA pull down, were loaded. 
Laminin 111 overlay assay demonstrated that despite differences in molecular 
mass, the -DG glycoforms in various tissues retain the capacity to bind 
laminin. Cb, cerebrum; Cl, cerebellum; E, eye (globe); H, heart; K, kidney; L, 
lung; n.l., native laminin; SkM, skeletal muscle; T, thymus. d, However, higher 
molecular mass o-DG species have increased binding capacity for laminin 111, 
as demonstrated by solid-phase assay, indicating that tissue-specific 
modification of «-DG LARGE-glycan levels is a possible means to modify cell 
interactions with the extracellular matrix environment. Error bars indicate 
s.e.m., curve fitting to equation f = B,,a.*abs(x)/(Kg + abs(x)). 
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Structural basis for action by diverse antidepressants 
on biogenic amine transporters 


Hui Wang’, April Goehring', Kevin H. Wang’, Aravind Penmatsa', Ryan Ressler! & Eric Gouaux!* 


The biogenic amine transporters (BATs) regulate endogenous neu- 
rotransmitter concentrations and are targets for a broad range of 
therapeutic agents including selective serotonin reuptake inhibi- 
tors (SSRIs), serotonin-noradrenaline reuptake inhibitors (SNRIs) 
and tricyclic antidepressants (TCAs)'*. Because eukaryotic BATs 
are recalcitrant to crystallographic analysis, our understanding of 
the mechanism of these inhibitors and antidepressants is limited. 
LeuT is a bacterial homologue of BATs and has proven to be a valu- 
able paradigm for understanding relationships between their struc- 
ture and function’. However, because only approximately 25% of 
the amino acid sequence of LeuT is in common with that of BATs, 
and as LeuT is a promiscuous amino acid transporter’, it does not 
recapitulate the pharmacological properties of BATs. Indeed, SSRIs 
and TCAs bind in the extracellular vestibule of LeuT*”’ and act as 
non-competitive inhibitors of transport®. By contrast, multiple 
studies demonstrate that both TCAs and SSRIs are competitive inhi- 
bitors for eukaryotic BATs and bind to the primary binding 
pocket*°. Here we engineered LeuT to harbour human BAT-like 
pharmacology by mutating key residues around the primary binding 
pocket. The final LeuBAT mutant binds the SSRI sertraline with a 
binding constant of 18 nM and displays high-affinity binding to a 
range of SSRIs, SNRIs and a TCA. We determined 12 crystal struc- 
tures of LeuBAT in complex with four classes of antidepressants. 
The chemically diverse inhibitors have a remarkably similar mode 
of binding in which they straddle transmembrane helix (TM) 3, 
wedge between TM3/TM8 and TM1/TM6, and lock the transporter 
in a sodium- and chloride-bound outward-facing open conforma- 
tion. Together, these studies define common and simple principles 
for the action of SSRIs, SNRIs and TCAs on BATs. 

We used the structure of wild-type LeuT in complex with the compe- 
titive inhibitor tryptophan (PDB code 3F3A)* as a template for mutant 
design (Fig. 1a). We analysed residues within a 10A radius of the 
primary binding pocket of the LeuT-Trp complex (Fig. 1a) together 
with a LeuT/human serotonin transporter (SERT) amino acid sequence 
alignment to identify about 20 residues which point towards the prim- 
ary binding pocket and are divergent from SERT (Supplementary Fig. 1). 
These residues are located in both bundle and scaffold domains’’, 
sodium binding sites’, the chloride binding site’*”’ and the extracellu- 
lar vestibule. Previous studies have demonstrated the importance of 
many of these residues in SERT pharmacology”'*!°”°"". By tracking 
the binding constant (Kg) of [°H] paroxetine, we introduced these 
mutations into LeuT, focusing initially on ‘first shell’ residues predicted 
to interact directly with inhibitors and next on ‘second shell’ residues 
(Supplementary Table 1). The Kg values for paroxetine and mazindol 
binding to the final LeuBAT mutant, deemed A13 LeuBAT (Sup- 
plementary Table 1), are 431 + 24nM and 112 + 18 nM, respectively 
(Supplementary Fig. 2). Notably, the Ky of A13 for mazindol is similar 
to that of SERT (103 + 4.7 nM)”. Because uptake experiments using the 
A6 or A13 variants reconstituted into liposomes show that the constructs 
are not active in transporting either serotonin or dopamine (Supplemen- 
tary Fig. 3), further experiments would be required to engineer a variant 


of LeuBAT that possesses both high-affinity inhibitor binding and 
transport activity. 

For the A13 LeuBAT construct we performed competition experi- 
ments using [*H] paroxetine and multiple cold SSRIs, SNRIs anda TCA 
(Fig. lb-d and Supplementary Table 2). Notably, sertraline possesses 
the highest affinity (Ky = 18 + 2nM; K, = 14 + 2 nM; Fig. Ic, d), thus 
approaching the reported value for sertraline binding to SERT (0.3 nM)”. 
To demonstrate that the A6é and A13 variants possess increased affini- 
ties for inhibitors relative to wild-type LeuT, we determined the Kg 
values for sertraline and mazindol binding to wild-type LeuT to be 
308 + 63 nM and 22.3 + 5.4 |M, respectively, whereas the binding of 
paroxetine could not be fit to an isotherm because of low affinity (Sup- 
plementary Fig. 2). The substrate alanine, which binds to the primary 
pocket of wild-type LeuT’, could not suppress the binding of sertraline 
to wild-type LeuT (Supplementary Fig. 2h), consistent with the con- 
clusion that these drugs bind within the extracellular vestibule of wild- 
type LeuT*”. 

We determined crystal structures of LeuBAT in complex with a 
panel of SSRIs, SNRIs and a TCA using the A5, A6é and A13 variants 
(Supplementary Table 3). For the A5 and A6é mutants, we determined 
structures for the A5-mazindol, A6é-sertraline, A6—desvenlafaxine, A6— 
duloxetine and A6é-mazindol complexes at resolutions of 2.3-2.7 A. For 
the A13 variant, we determined seven structures with sertraline, par- 
oxetine, fluoxetine, fluvoxamine, duloxetine, desvenlafaxine and clomi- 
pramine (CMI) at resolutions of 2.85-3.31 A (Supplementary Fig. 4 
and Supplementary Table 3). Because the binding positions of inhibi- 
tors is similar between the A6é and A13 constructs (Supplementary Fig. 
5), we used the higher resolution structures of the A6 complexes for 
analysis and validation of the drug-binding sites in A13. 

All LeuBAT structures adopt an outward-facing open conformation 
(Figs 2a and 3), similar to that of wild-type LeuT in complex with 
tryptophan* (root mean squared deviation (r.m.s.d.) of 0.48 A for Co 
atoms). All drugs bind to the primary binding pocket’, interact with 
both the bundle (TM1, TM6) and scaffold (TM3, TM8) domains, and 
are lodged between the extracellular gate residues Arg 30 and Asp 404 
(ref. 3). These observations invalidate the notion that SSRIs and TCAs 
elicit their effects on SERT by binding in the extracellular vestibule*””’. 

We find two sodium ions, Nal and Na2, bound to sites similar to 
those in the LeuT-Trp complex with the following distinction. In the 
wild-type LeuT structures** the o-carboxyl group of Leu or Trp parti- 
cipates in the coordination of Nal, whereas in the LeuBAT structures a 
water molecule is found at the equivalent position, bridging the car- 
boxylate of Asp24 and Nal (Supplementary Fig. 6). We detected 
electron density for a chloride ion in the LeuBAT-paroxetine structure 
~4.5A from Nal, coordinated by Tyr47, Ser254, Asn286 and 
Ser 290, as predicted previously (Supplementary Fig. 6)'°?7*. 

We solved structures of four LeuBAT-SSRI complexes: sertraline, 
paroxetine, fluoxetine and fluvoxamine. Sertraline occupies the primary 
pocket surrounded by TM1, TM3, TM6 and TM8 and buries 438.6 A 
or 93%, of its surface area (Fig. 2). The amine group forms a salt bridge 
with the carboxyl group of Asp 24 whereas the tetrahydronaphthalene 
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Figure 1 | LeuBAT design and pharmacology. a, The representation of 
mutation positions around the primary binding pocket in the wild-type 
LeuT-Trp structure (PDB 3F3A). Bound tryptophan (yellow) and the mutated 
residues are in sticks. The transmembrane helices TM1, TM3, TM6, TM8 and 
TM10 around the pocket are highlighted as green, red, purple, orange and blue, 
respectively. Asterisks depict the glycine residue positions. b, Chemical 
structures of four SSRIs, two SNRIs, one TCA (CMI) and one stimulant 
(mazindol). c, Measurement of [*H]sertraline binding (filled circles) to A13 
LeuBAT. d, Dose-response curves for inhibition of [*H]paroxetine binding to 
A13 LeuBAT by sertraline (filled diamonds), fluvoxamine (empty circles), 
fluoxetine (empty diamonds), duloxetine (empty inverted triangles), CMI 
(empty triangles) and desvenlafaxine (empty squares). Error bars, s.e.m, n = 3. 


ring participates in hydrophobic interactions with Tyr 21 and is sand- 
wiched between Val 104, Tyr 108 and Phe 259 (Fig. 2). The two chlorine 
atoms on the dichlorophenyl ring insert into a groove formed by 
Pro 101, Val 104, Ala 105, Ser 356 and Gly 359. We suggest that these 
extensive hydrophobic and van der Waals interactions contribute to 
the high affinity of sertraline for LeuBAT and human SERT. 
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Figure 2 | LeuBAT A13-sertraline complex adopts an outward-facing open 
conformation. a, Cross-sections of the crystal structure of A13-sertraline 
showing the solvent-accessible surface area (blue). Bound sertraline is shown in 
yellow sticks. b, c, Zoom into the sertraline binding pocket viewed within the 
membrane plane (b) and from the extracellular side (c). Sertraline and key 
residues in the pocket are depicted in both sticks and spheres showing the van 
der Waals radii. Salt bridges and hydrogen bonds are in dashed lines. Phe 259 in 
b is omitted for clarity. 


The other SSRIs—paroxetine, fluoxetine and fluvoxamine—also 
bind to the primary binding pocket in an orientation similar to that 
of sertraline. Consistent with previous SERT-paroxetine and SERT- 
fluoxetine models’’, the amine groups are proximal to the carboxyl 
group of Asp 24 (Fig. 3a, b). The amine groups of the inhibitors also 
form direct hydrogen bonds with main-chain carbonyl groups of 
Tyr 21, Ala22 and/or Phe 253. The benzodioxol group of paroxetine 
and trifluoromethylphenyl rings from fluoxetine and fluvoxamine 
insert to the same groove, as does the chlorophenyl ring of sertraline, 
forming hydrophobic interactions with Val 104, Tyr 108 and Phe 259, 
and/or van der Waals interactions with the main-chain carbonyl 
groups of Pro 101, Ala 105, Ser 356 and Gly 359. In addition, the fluor- 
ophenyl ring of paroxetine, the phenyl ring of fluoxetine and the ether 
chain of fluvoxamine extend into the extracellular vestibule, forming 
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Figure 3 | SSRIs, SNRIs, TCA and mazindol share similar binding features. 
a, Paroxetine binding site in the A13 LeuBAT mutant, viewed within the 
membrane plane with paroxetine shown as pink sticks. Sodium ions and key 
residues are shown as black spheres and orange sticks, respectively. Hydrogen 
bonds, salt bridges and polar interactions are in dashed lines. 

b, Superimposition of paroxetine (pink), (R)-fluoxetine (green), fluvoxamine 
(blue) and sertraline (yellow) in the primary binding pocket of the A13 
LeuBAT. c, (S)-duloxetine binding site in the A13 LeuBAT with duloxetine 
shown as cyan sticks. d, Superimposition of (S)-duloxetine (cyan) and 
(S)-desvenlafaxine (magenta) in the primary drug-binding pocket. e, CMI 
binding site in A13 LeuBAT with CMI shown as olive sticks. f, Mazindol 
binding site in the LeuBAT A6 variant with mazindol molecule shown as sand- 
coloured sticks. 


hydrophobic interactions, van der Waals contacts and hydrogen-bonding 
interactions with Tyr 107, Phe 253, Asp 404 and/or Thr 408. 

In the SNRI complexes with duloxetine and desvenlafaxine, the 
inhibitors sit in the primary pocket with the amine groups interacting 
with the carboxyl group of Asp 24 and, in the case of duloxetine, also 
with the main-chain carbonyl of Tyr 21 (Fig. 3c, d). The naphthalene 
ring from duloxetine and the cyclohexanol ring from desvenlafaxine 
are sandwiched by hydrophobic groups Tyr 21, Val 104, Phe 259 and 
Tyr 108. The thiophene ring from duloxetine and the phenol ring from 
desvenlafaxine protrude into the extracellular vestibule and interact 
with Phe 253 by edge-to-face interactions. For desvenlafaxine, the 
hydroxyl group in the cyclohexanol moiety makes a hydrogen bond 
with the main-chain carbonyl group of Ser 355, whereas the phenol 
hydroxyl interacts with Asp 404 and the phenol group of Tyr 107. 

The tricyclic antidepressant CMI binds to the primary binding pocket 
of A13 LeuBAT (Fig. 3e), in agreement with a human SERT-TCA 
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model'"”” and in contrast to wild-type LeuT-TCA structures*®. The 
tricyclic ring is surrounded by hydrophobic residues including Tyr 21, 
Val 104, Tyr 108, Phe253 and Phe259. The chlorine atom in the 
tricyclic ring extends to the pocket formed by Ala 105, Ser 356 and 
Gly 359, similar to the chorine positions in sertraline or the trifluor- 
omethyl moiety in fluoxetine. Not only are previously proposed inter- 
actions” observed in our LeuBAT-CMI structure, such as the salt 
bridge between Asp 24 (Asp 98 in SERT) and the tertiary aliphatic 
amine of the TCA, the structure is consistent with interactions 
between Ala 105 (Ala 173 in SERT) and the TCA 3-position, and with 
Phe 253 (Phe335 in SERT) being near the TCA 7-position. Our 
structure is in harmony with the conclusion that Ser 438 in human 
SERT (Ser 355 in LeuBAT) is vicinal to the aminopropyl chain of 
TCA and that the S438T mutation affects the binding affinity because 
of steric clash"’. 

The stimulant mazindol binds to the primary site and is surrounded 
by a hydrophobic pocket formed by Tyr 21, Val104, Tyr 108 and 
Phe 259 (Fig. 3f). The amine nitrogen forms a salt bridge with the 
carboxyl group of Asp 24. The hydroxyl group hydrogen bonds with 
the carboxyl group of Asp 24 and the phenol group of Tyr 108. The 
chloropheny] ring inserts into the pocket formed by Ala 105, Ser 356 
and Gly 359, similar to sertraline. The importance of these interactions 
is supported by the fact that removal of the chlorophenyl ring or 
changes in the substitution on the phenyl ring decreases the affinity 
of mazindol to SERT’. 

By soaking A6 mutant crystals in 20 mM desvenlafaxine, we iden- 
tified a second desvenlafaxine molecule in the extracellular vestibule 
(Supplementary Fig. 7). The second molecule occupies a similar posi- 
tion to the positions that TCAs, sertraline and fluoxetine occupy in 
wild-type LeuT structures reported previously*’ (Supplementary Fig. 7). 
These results demonstrate that the n-octyl-B-p-glucoside molecule 
bound to the extracellular vestibule of wild-type LeuT”*”® is readily 
substituted by a drug molecule and that the extracellular vestibule is 
a site for the low-affinity, non-specific binding of small molecules. 

Analysis of the sertraline (SSRI), duloxetine (SNRI) and CMI (TCA) 
complexes with LeuBAT allows us to identify three subsites within the 
primary binding site to which the pharmacophores of these chemically 
diverse inhibitors bind (Fig. 4a, b). Subsite A is defined by Asp 24, 
Tyr 21, Gly 256 and Ser 355 from TM1, TM6 and TM8, and accom- 
modates the polar, amine moiety of the inhibitors”’. Asp 24 interacts 
with the amine groups of most drugs by salt bridge, which echoes the 
suggestion that Asp 98 in human SERT forms a similar interaction 
with the amine groups of SSRIs, TCAs and serotonin?’**7°77, 

Subsite B includes residues from TM3, TM6 and TMB and involves 
two types of interactions. First, nonpolar residues form hydrophobic 
interactions with the hydrophobic rings of the drugs. Phe 259 (Phe 341 
in human SERT) together with Val 104 (Ile 172 in human SERT or 
Val 148 in the human noradrenaline transporter (NET)) define a non- 
polar ridge that accommodates the hydrophobic groups of the drugs’””*. 
Previous studies showed that the F341Y mutation in human SERT 
reduces the potency of paroxetine and escitalopram’’. Here we suggest 
that the F341Y mutation leads to a clash with the fluorophenyl ring of 
paroxetine. The second type of interaction in subsite B is the groove 
delineated by Pro101, Ala105, Gly359 and Ser 356. This groove 
accommodates the polar groups in the drugs’ rings, such as the chloro, 
dichloro, trifluoromethyl and benzodioxol groups of CMI, sertraline, 
fluoxetine and paroxetine, respectively, and is capped by Tyr 108. A 
SERT-imipramine model suggests that the imipramine 3-position is 
vicinal to Ala 173 (Ala 105 in LeuBAT)”, and here we show that the 
chlorine atom in the 3-position of CMI forms a direct contact with this 
ridge. 

Subsite C, distal to the primary, orthosteric binding site, is located in 
the extracellular vestibule, and is comprised of residues in TM6 and 
TM10, including Phe 253, Asp 404 and Thr 408. Subsite C interacts 
with bulky drugs such as paroxetine, desvenlafaxine and fluoxetine, 
and probably has a role in enhancing inhibitor affinity and specificity. 
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Figure 4 | Implication for drug binding in human SERT and validation by 
mutational studies. a, Superposition of SSRI (sertraline, yellow), SNRI 
(duloxetine, cyan) and TCA (CMI, olive) in the primary binding pocket of 
LeuBAT, viewed from extracellular side. The key residues in the pockets and the 
two sodium ions are shown as sticks and black spheres, respectively. The 
regions enclosed by dashed lines define subsites A, B and C in the primary drug 
binding pocket. b, Schematic representation of drug interactions in the primary 
binding pocket of LeuBAT. The transmembrane helices are shown as cylinders. 
Residue numbering follows LeuBAT and SERT, respectively. c, Plot of 
sertraline binding constants for Y21A, D24E, F259Y and $355T mutants of the 
A13 LeuBAT against the inhibition constants for the corresponding mutants 
Y95A, D98E, F341Y and S438T in SERT, respectively. WT, wild type. Error 
bars, s.e.m, 1 = 3. 


Indeed, previous studies determined that mutation of E493Q in human 
SERT (Asp 404 in LeuBAT) attenuates the potency of fluoxetine’. 

To support the conclusion that the LeuBAT complexes represent 
reliable models for BAT-inhibitor complexes, we prepared and analysed 
the individual Y21A, D24E, F259Y and $355T mutations in the context 
of the A13 LeuBAT construct. Y21A, D24E and S355T are in subsite A 
and F259Y is in subsite B. We next measured their [*H]sertraline Ky 
values and plotted log Kg of the LeuBAT mutants against log K; of the 
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homologous human SERT mutants” (Fig. 4c). The resulting linear 
relationship suggests that the mutations in LeuBAT have similar effects 
to those in SERT, and suggests that LeuBAT represents a framework 
for understanding the pharmacology of human SERT. We further 
compared the pharmacological selectivity of LeuBAT with all human 
BATs by plotting log K, (LeuBAT) versus log K, (human dopamine 
transporter (DAT), NET or SERT) (Supplementary Fig. 8). Inspection 
of these plots suggests that the pharmacological properties of LeuBAT 
are a hybrid of human BATs. Moreover, we compared the TCA bind- 
ing site of the LeuBAT-CMI complex with the recently determined 
structure of the Drosophila DAT-nortriptyline complex” (Supplemen- 
tary Fig. 9). We note that the overall outward-open conformation and 
the essential elements of inhibitor binding are shared between the two 
structures. Finally, we reverted residues Asp 24, Gly 256 and Gly 359 in 
LeuBAT to their LeuT identities and investigated their effects on the 
drug binding. These mutations profoundly diminish drug binding 
(Supplementary Fig. 10), thus supporting the conclusion that the LeuBAT- 
inhibitor crystal structures represent specifically bound ligand-transporter 
complexes. 

Taken together, the LeuBAT complexes allow us to map crucial sub- 
sites within the primary, orthosteric binding site that are responsible 
for binding the pharmacophores of a chemically diverse group of 
SSRIs, SNRIs and TCAs (Supplementary Fig. 11); they show that these 
inhibitors act by binding to the outward-open conformation of the 
transporter; and, perhaps most importantly, they provide molecular 
guideposts for the development of new therapeutic agents. 


METHODS SUMMARY 


The LeuBAT mutants were expressed and purified as previously described? except 
that lauryl maltose neopentyl glycol was used for solubilization and purification. 
After final purification by size-exclusion chromatography in n-octyl-B-p-glucoside, 
LeuBAT was concentrated to 2.5 mg ml _' and supplemented with saturating sero- 
tonin or mazindol prior to crystallization. All LeuBAT-drug complexes except 
mazindol were formed by soaking LeuBAT crystals in crystallization solutions 
containing 3-20mM of each drug. The structures were solved by molecular 
replacement using the LeuT-Trp structure’ as a search probe and then subjected 
to crystallographic refinement. The functional activities of LeuBAT mutants were 
examined by (*H] paroxetine, [PH]sertraline and [*H]mazindol saturation and 
competition binding assays. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Protein expression and purification. The LeuBAT mutants were expressed and 
purified as described previously for wild-type LeuT*”° with two exceptions. First, 
lauryl maltose neopentyl glycol (MNG-3) instead of n-dodecyl-f-p-maltopyranoside 
(C,2M) was used for solubilization and purification. Second, no drugs or substrate 
were added during purification. Cell membranes were solubilized using a buffer 
composed of 25 mM Tris-HCl (pH 8.0), 250 mM NaCl and 1% MNG-3 and the 
protein was then purified by immobilized-metal affinity chromatography (IMAC) 
in the presence of 0.02% MNG-3. For crystallization, LeuBAT was purified by size- 
exclusion chromatography in 25 mM Tris-HCl (pH 8.0), 150 mM NaCl and 40 mM 
n-octyl-B-p-glucoside. 

Crystallization and crystal soaking. LeuBAT with saturated serotonin or mazindol 
was crystallized by vapour diffusion at 20°C with two different precipitating 
solutions containing: 100mM NaPi (pH7.0), 100mM NaCl, 32-34% PEG300; 
or 100mM glycine (pH 9.4), 0.1M LizSO4, 29-31% PEG400. LeuBAT crystals 
grown in the presence of serotonin were soaked for 1h in crystallization solution 
containing 4mM paroxetine, 20 mM desvenlafaxine, 3.7 mM sertraline, 3.7 mM 
fluoxetine, 3 mM duloxetine, 3 mM fluvoxamine or 10 mM CML. After soaking, 
crystals were directly flash frozen in liquid nitrogen before X-ray diffraction data 
collection. 

Data collection and structure elucidation. Diffraction data sets were collected at 
the Advanced Light Source (beamlines 5.0.2 and 8.2.1) and Advanced Photon 
Source (Argonne National Laboratory, beamlines 24-ID-C and 24-ID-E). Data 
sets were processed using HKL2000 software*® and iMOSFLM”'. The LeuBAT- 
antidepressant structures were determined by molecular replacement using LeuT-Trp 
(PDB code 3F3A) as a search probe using Phaser” in CCP4 suite’’. The drugs were 
built based on the omit maps that were calculated by Phenix”, with the antide- 
pressants omitted from the structure factor calculation. The refinements were 
carried out using Phenix and Refmac in CCP4 suite*’, and manual adjustments 
were made using Coot**. The structure quality analysis was carried out using 
Molprobity** with 97% of the residues in the most favoured regions and none in 
disallowed regions based on Ramachandran analysis. All structure figures were 
generated with PyMOL (DeLano Scientific). 

Binding and competition assays. For binding studies, protein from IMAC frac- 
tions was desalted by Zeba Desalt Spin Columns in buffer A (150 mM Tris-MES 
(pH7.5), 100mM NaCl, 20% (v/v) glycerol, 0.02% MNG-3). Scintillation prox- 
imity assays (SPA) were performed as before’’. Sertraline binding was performed 
by incubating 4nM protein with 1-2 mgml | copper yttrium silicate (Cu-Ysi) 
SPA beads in buffer A in the presence of 0.3-300 nM [*H]sertraline (4 Cimmol'). 
For mazindol binding, 25 nM protein was incubated with 1-2 mg ml * Cu-Ysi SPA 
beads in buffer A in the presence of 1-3,000 nM (°'H]mazindol (2.5 Cimmol ’). 
For paroxetine binding, 50-100 nM protein was incubated with 1-2 mg ml“! Cu-Ysi 
SPA beads in buffer A in the presence of 0.01-10 uM PH] paroxetine (1.04 Cimmol ~ 4, 
Nonspecific binding was measured in the presence of 400 mM imidazole, 0.1 mM 


mazindol and/or 0.5 mM CMI. Plate readings were taken using a Wallac Microbeta 
plate counter. Data after overnight incubation was analysed by GraphPad Prism4 
and fit into a single-site binding function. Experiments were performed three 
times. 

Competition assays were performed by using 300nM_ [*H]paroxetine 
(4.16 Cimmol ') in buffer A. For half-maximum inhibitory concentration mea- 
surements, the concentrations of inhibitors were varied from 3 nM to 6.7 mM. A 
protein concentration of 100 nM was used for the competition assays, except that 
10 nM protein was used for the sertraline competition experiments. The K; values 
were calculated by using the Cheng—Prusoff equation. 

To compare mazindol binding to the A13, A13(D24G), A13(G256S) and 

A13(G3591) mutants, proteins (50nM) were incubated with 1 mg ml — 1 Cu-Ysi 
SPA beads in buffer A in the presence of 100 nM (?H]mazindol (27.8 Cimmol '). 
For sertraline binding, proteins (27 nM) were incubated with 1 mg ml! Cu-Ysi 
SPA beads in buffer A in the presence of 15 nM [H]sertraline (80 Cimmol '). 
Nonspecific binding was measured in the presence of 0.1mM mazindol and 
sertraline, respectively. 
Fluorescence-detection size-exclusion chromatography. Fluorescence-detection 
size-exclusion chromatography (FSEC) was performed as described previously”*. 
Before the FSEC experiment, 1.2-4.8 pmol of each variant protein was incubated 
with 45-90 pmol of the 2B12 monoclonal antibody” for 1 h and then loaded onto a 
Superose 6 column pre-equilibrated with 20 mM Tris-HCl (pH 8.0), 150 mM NaCl 
and 0.02% MNG-3 detergent. The elution was monitored by an in-line fluorescence 
detector which was set with excitation at 280 nm and emission at 335 nm. The 
proteins in the absence of 2B12 were used as a control (Supplementary Fig. 10). 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/naturel2747 


Corrigendum: Exercise-induced 
BCL2-regulated autophagy is 
required for muscle glucose 
homeostasis 


Congcong He, Michael C. Bassik, Viviana Moresi, Kai Sun, 
Yongjie Wei, Zhongju Zou, Zhenyi An, Joy Loh, Jill Fisher, 
Qihua Sun, Stanley Korsmeyer, Milton Packer, Herman I. May, 
Joseph A. Hill, Herbert W. Virgin, Christopher Gilpin, 
Guanghua Xiao, Rhonda Bassel-Duby, Philipp E. Scherer 

& Beth Levine 


Nature 481, 511-515 (2012); doi:10.1038/nature10758 


In this Letter, information was inadvertently omitted that the results 
in Supplementary Fig. 21 represent the mean + s.e.m. for 4-5 mice 
per group. We also note that the samples used for Fig. 3f, Supplemen- 
tary Figs 3 and 13a were derived from the same experiment, and that 
the gels/blots were processed in parallel. Equal concentrations of protein 
were loaded on all gels. The same actin loading control gel is shown in 
Fig. 3f, Supplementary Figs 3 and 13a. Furthermore, owing to a pro- 
duction error at the proof stage, the time point labels were missing 
from Fig. le. After “Exercise (min)’, the labels should read 0, 15 and 30. 
This has now been corrected in the PDF and HTML versions of the 
paper. None of these changes alters the conclusions of the paper. 
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Retraction: Functional dissection of 
lysine deacetylases reveals that 
HDACI and p300 regulate AMPK 


Yu-yi Lin, Samara Kiihl, Yasir Suhail, Shang-Yun Liu, 
Yi-hsuan Chou, Zheng Kuang, Jin-ying Lu, Chin Ni Khor, 
Chi-Long Lin, Joel S. Bader, Rafael Irizarry & Jef D. Boeke 


Nature 482, 251-255 (2012); doi:10.1038/nature10804 


In response to a concern raised by a reader about inconsistencies in 
our Letter between the results from the primary microarray screen 
and cell growth validation studies (Supplementary Table 2), we reviewed 
the methods described, and the subsequent analytical and validation 
work. We conclude that the Methods section in our Letter is inaccurate, 
and that for 38% of the interactions found by the primary screen there 
was a discordance in sign when validated (for example, an interaction 
might be called ‘synthetic lethal’ by the primary screen but ‘synthetic 
rescue’ in the cell growth validation assay, or vice versa). Although the 
results from the reanalysis did not contradict our key conclusions, we 
also re-evaluated the experiments in Figs 2-4 that indicate that p300 
acetylates PRKAAI and HDACI deacetylates it. We sequence-verified 
and spot-checked many of the plasmids and cell lines described, which 
are available on request. The biological materials were found to be as 
we described, and certain aspects, such as p300 acetylation of PRKAA1 
protein in vitro, could be reproduced. However, despite several attempts, 
we were unable to obtain results definitively supporting the major 
claims of the Letter, namely that p300 is the acetyltransferase and 
HDAC1 is the deacetylase for PRKAA1 in cells. The batch of polyclo- 
nal antibody against acetyl-lysine originally used is no longer available. 
More definitive experiments require antibodies that specifically recog- 
nize acetylated sites on PRKAAI, which we are developing. Although 
our inability to reproduce these results does not mean our conclusions 
are incorrect, we cannot say with confidence that they are correct. Given 
the time that has elapsed, and our inability to reproduce the main 
conclusions of the Letter, we feel obliged to retract it. All the authors 
have signed this retraction with the exception of Y.-Y.L., who is deceased. 
Correspondence should be addressed to J.D.B. (jboeke@jhmi.edu). 
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BRAIN MAPPINGIN 
HIGH RESOLUTION 


Tools that make it possible to chart every neuron and its connections are 
helping neuroscientists to realize their dream of whole-brain maps. 


BY VIVIEN MARX 


esearchers wanting to understand the 
Resin of the brain need maps on 
any different scales — to link struc- 

ture to function, to parse the complexities of 
memory loss or learning disability, or to find 
out what is different about the brains of people 
with neurodegenerative diseases. For instance, 
programmes such as the Human Connectome 
Project, funded by the US National Institutes 
of Health (NIH) in Bethesda, Maryland, and 
the Consortium of Neuroimagers for the Non- 
invasive Exploration of Brain Connectivity 


and Tracts (CONNECT), funded by the Euro- 
pean Commission in Brussels, are producing 
magnetic resonance imaging (MRI) maps that 
follow neuronal connections across the entire 
brain, on a scale of tens of centimetres. 

Some research teams are embarking on even 
more ambitious projects to create maps that 
reveal structure and connections at the scale 
of individual neurons. The slender processes 
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of neurons — the axons and dendrites — are 


20 micrometres or less in diameter, but can 
extend for several millimetres. Mapping the 
thicket of 86 billion neurons and their connec- 
tions in the human brain at this scale is still 
a distant dream, but the goal of mapping the 
75 million neurons in the mouse brain could 
bea little closer. And the second might pave 
the way for the first. 

To achieve this, however, researchers need 
new tools, starting with refinements to elec- 
tron microscopy. In conventional electron- 
microscopy approaches, scientists have to 
make ultrathin slices of brain tissue and > 
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> laboriously image them slice by slice to 
build up a three-dimensional (3D) picture. 
In 1986, the complete nervous system of 
one model organism was mapped using this 
approach — the 302 neurons of the nematode 
Caenorhabditis elegans' — but the process is 
too slow and cumbersome to scale up. Now, 
electron microscopes at many universities are 
“being mothballed”, says neurobiologist Jeff 
Lichtman at Harvard University in Cambridge, 
Massachusetts, because they are deemed a 
throwback to an era when “all you could do 
was stain and look”. 

Nevertheless, high-resolution, 3D neuronal 
mapping by electron microscopy is gain- 
ing momentum. “It’s not just the old neuro- 
anatomy gussied up with new machines,” 
says Lichtman, who studies the neuronal 
architecture of the mouse brain. “It’s giving 
us three-dimensional information about 
structure at the super-resolution of nano- 
metres — and that is invaluable.” These capa- 
bilities are spurring the development of new 
ways to prepare, image and analyse brain tis- 
sue from model organisms such as mice, and 
from people who have donated their brains 
to science on their death. 

Until now, when neuroscientists published 
cell-level images and analysis, they were look- 
ing at a tiny region of brain; for example, a few 
hundred neurons in the mouse or fruitfly”’. 
Efforts to map all the neurons and circuits in 
the mouse brain are just beginning in several 
labs. Creating structural maps isa high priority 
of the NIH-funded Brain Research Through 
Advancing Innovative Neurotechnologies 
(BRAIN) Initiative, which US President Barack 
Obama launched in April’. Another large-scale 
effort is the Human Brain Project, funded by 
the European Union, which is focused on a dif- 
ferent kind of map — a computational model 
of the brain. 

Neurobiologist and microscopist Winfried 
Denk at the Max Planck Institute for Medi- 
cal Research in Heidelberg, Germany, thinks 
that the technology to 
image an entire mouse 
brain at cellular reso- 
lution is within grasp. 
Late next year, he and 
Lichtman are each 
scheduled to receive a 
new multi-beam scan- 
ning electron micro- 
scope (SEM) for their 
labs, developed by 
Carl Zeiss Microscopy 
in Oberkochen, 
Germany. Now in 


the prototype stage, Winfried Denk 
this instrument will estimates that 
enable the imaging age capture of 
of brain slices to be awhole mouse 
accelerated hugely — brain could yield 
possibly byas much 60 petabytes of 
as 60 times. Such data. 
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One manufacturer is developing wide diamond knives that can be used 
to shave off samples containing a slice of the whole mouse brain. 


Knives in current 
use are usually 


around 1 mm wide. 


a. 
Cutting edge is only 
4 nanometres wide 
when polished. 


“72 


Sy; 


Wide diamond knife has to be 
made from a perfect crystal. 


Brain tissue embedded in resin 


gains could make it feasible to map all the 
neurons in a mammalian brain. Other micro- 
scope manufacturers, such as FEI in Eind- 
hoven, the Netherlands, are also developing 
tools to enable detailed brain mapping. 


PREPARING TO SEE 

Before brain tissue can be imaged using elec- 
tron microscopy, samples must be prepared 
by slicing and staining. These steps are impor- 
tant for the subsequent process of using the 
two-dimensional images generated by the 
electron microscope to reconstruct the three 
dimensions of the brain and its neuronal con- 
nections. Scientists are exploring new ways to 
prepare samples so that they can obtain higher 
contrast for electron microscopy and view 
larger pieces of brain tissue. 

When brain tissue is well stained with chem- 
icals, thinly sliced and viewed using electron 
microscopy, researchers see slices that look 
like thin plates covered with tiny soap bubbles. 
Some bubbles represent the cross-section of a 
neuron, and the soap-bubble boundary shows 
where one neuron ends and another begins. 
Contrast is crucial, because scientists need to 
discern neurons from a wealth of other cell 
types and organelles in the brain tissue. They 
can then trace each neuron in each of the 
imaged slices, by hand and by eye. 

But such imaging has its problems: stain- 
ing often does not show enough structure, 
the slices might be warped and images can be 
blurry. 

Denk and his team previously developed 
an approach called serial block-face electron 
microscopy (SBEM), which obviates the need 
to prepare slices before imaging them. Instead, 
successive images are obtained by scanning 
the face of an unsliced block of tissue placed 


in the electron microscope, then cutting off an 
ultrathin slice using an automated microtome 
within the instrument. The newly exposed 
surface of the sliced block is rescanned, and so 
on until a stack of images has been obtained. 
The advantage of imaging the unsliced tissue, 
Denk says, is that it does not matter if the slice 
itself crumples. 

Until now, Denk and his team have cut only 
from pieces of tissue that measured much less 
than 1 mm across. But Denk’s ultimate aim is to 
image and cut slices from a whole mouse brain, 
which will mean dealing with tissue blocks that 
are some 10 mm across. Denk is now building 
a whole-brain microtome to incorporate into 
the microscope. This will require a diamond 
knife that is 8-10 mm wide instead of the 
1-mm knife used at present, he says. 

To make these knives, he is collaborating 
with diamond-knife manufacturer Diatome 
in Biel, Switzerland. The longer blade width 
isa challenge, says Diatome engineer Helmut 
Gnaegi. At 10 mm, it is around ten times wider 
than a typical diamond knife, and the blade 
must be made from a large diamond that is 
free of crystalline imperfections (see ‘Sharp 
tools). 

The entire 10-mm cutting edge must be 
polished to perfection, Gnaegi says, because 
even a single knife mark would interfere with 
the smooth sample surface that Denk requires. 
After polishing, the cutting edge must have a 
maximum thickness of only 4 nm. This is 
a “formidable challenge” that the company 
hopes to overcome by the end of this year, says 
Gnaegi. He and his team have designed special 
polishing equipment for the task. 

Gnaegi’s team is also addressing the build- 
up of electrostatic charge that occurs when 
the knife tries to cut through tissue that has 
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been embedded in non-conducting resin. The 
charge tends to make the sections stick to the 
sample surface, which makes imaging difficult. 
“An electrically conductive knife surface with 
optimized gliding properties leads to a sample 
surface free of cutting debris,” says Gnaegi. 

The SBEM approach helps with precise ren- 
dering of tissue in three dimensions, but it is 
hard to make it high-throughput. Researchers 
would have to set up many microscopes ina 
dedicated facility. The alternative is an instru- 
ment that is “intrinsically a parallel imaging 
machine’, Denk says. That is why his lab is so 
interested in the Zeiss multi-beam machine 
that is due next year. 

Lichtman and his team take a different 
approach to sample preparation. He has 
developed an automatic tape-collecting ultra- 
microtome (ATUM), which slices tissue, places 
each slice on a tape and delivers the slices to 
the microscope in a conveyor-belt fashion. 
Still other labs are working on different tissue- 
slicing approaches. “One of them will eventu- 
ally emerge as the way to do things, probably,” 
says Denk. 

Many labs are working on ways to improve 
the conventional tissue-staining methods used 
for electron microscopy, which rely on heavy 
metals such as lead and osmium. For example, 
Shawn Mikula, a postdoc in Denk’s team, has 
been experimenting with immersing whole, 
unsliced mouse brains in various staining 
chemicals for extended periods to enhance the 
spread of osmium throughout the tissue and 
to achieve adequate contrast in the electron 
microscope. This procedure worked well for 
imaging axons, Denk says, but was less suc- 
cessful for imaging the finer processes’. Given 
time, he says, “we're pretty confident that we'll 
figure it out?” 


BEAM BONANZA 

As part of their collaboration with Zeiss, both 
Lichtman and Denk deliver test samples to 
the company, which are imaged using the 
prototype microscope. At present, the num- 
ber of scientist-testers has to be limited, and 
the instrument is not available for pre-order. 


> 


Dirk Zeidler (left) and Gregor Dellemann are 
developing the new Zeiss microscope. 


Looking at images generated by the device 
has been “very exciting, because you realize 
how much faster those images were taken’, says 
Lichtman. When he and his team run experi- 
ments, the lab generates around 1 terabyte of 
image data a day. With the new microscope, 
he says, the rate will be more like 3 terabytes 
per hour. 

Using his tape-collecting microtome with 
the new instrument, Lichtman plans to slice 
a mouse brain into many tens of thousands of 
sections, which can be put into the machine in 
large groups. “It images all of them and then 
you put the next several hundred or thousand 
in,” he says. 

The Zeiss multiple-beam SEM resembles a 
standard SEM in many ways, says Dirk Zeidler, 
the physicist who led its design. It has an elec- 
tron source, lenses and a scanning unit that 
moves the beam across the sample surface 
horizontally, in the same way as a reader's eye 
moves over lines of text. It is built to be com- 
patible with the various ways in which neuro- 
scientists prepare tissue for imaging. 

But the differences from a conventional 
SEM are crucial. The machine has been engi- 
neered for a wide view of the tissue and for 
speed, says Gregor Dellemann, who is the 
business-development manager for the new 
microscope. The most obvious difference is 
that the instrument has 61 electron beams 
scanning across the sample instead of the one 
beam in a conventional SEM, and an array of 
61 secondary electron detectors (see ‘Speed 


110 um 


Zeiss’s prototype electron microscope uses 61 beams in a hexagonal configuration (left). The beams 
simultaneously scan sub-images (shown in yellow and blue squares) that merge into a large image (right). 


150 | NATURE | VOL 503 | 7 NOVEMBER 2013 


© 2013 Macmillan Publishers Limited. All rights reserved 


reading’). It also dispenses with features such 
as energy-dispersive X-ray spectroscopy detec- 
tors or back-scattered electron detectors, both 
of which are used in standard electron micro- 
scopes to map variations in a sample's chemical 
composition. 

Dellemann explains that including several 
detector types would mean incorporating 
three arrays of 61 detectors each, and would 
require a mechanism to guide the right signal 
to each one. Although possible, such a configu- 
ration would add cost and complexity to the 
instrument and slow it down. 

Some scientists had voiced doubts about 
the multiple-beam SEM when they first heard 
about it, says Zeidler, who notes that they even 
questioned whether it might violate the princi- 
ples of optics. But they recanted when shown a 
working instrument as it generated images, he 
says, adding “that was quite fun”. 

The new instrument addresses the speed 
limits and blind spots of a standard SEM. 
When the beam of electrons hits the sample, it 
emits secondary electrons that give informa- 
tion about, for example, the sample’s surface 
shape and staining. Detectors record the inten- 
sities of the secondary electrons and the instru- 
ment analyses these data to give an image of 
the sample. 

To speed up this process, the beam would 
need to move more quickly and detect second- 
ary electrons in less time. But when detection 
times fall below about 50 nanoseconds, the 
images get noisy. Furthermore, in the brief 
moment when electrons hit the detector, “it 
goes blind’, Dellemann says. No data can be 
collected until this ‘dead time’ is over and the 
detector ‘sees’ again. 

Enhancing the abilities of an SEM using 
multiple electron beams and detectors for 
wider and quicker imaging requires atten- 
tion to be paid to basic physics. Image qual- 
ity suffers if the multiple beams are too close 
to each other because the electrons repel one 
another. “It will get blurry, which means you 
do not see the features you want to see,” says 
Zeidler. 

The solution was to spread the charge over 
a larger area, minimizing the forces between 
electrons. More beams cover a larger sample 
area, and more detectors mean that the tem- 
porary blind spots are less of a hindrance. But 
the team still had to optimize the number of 
beams and their spacing, Dellemann says. 
Adding electron beams in a hexagonal con- 
figuration turned out to be the best geometry, 
and 61 beams with 61 detectors turned out to 
be the optimal number. 

Most SEMs detect electrons for intervals of 
around 50 nanoseconds, such that the observer 
can image 20 million pixels per second before 
any physical limits are reached, Dellemann 
says. Ina multi-beam instrument, each beam 
has a dedicated detector in the array and the 
50-nanosecond detection speed stays the same. 
So with 61 beams, the images can get bigger. In 
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SPEED READING 


The prototype multiple-beam electron microscope has 61 beams and detectors 
to give a wide view of the sample and to enhance imaging speed. 
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1 second, Dellemann says, the instrument can 
take images of up to 1.2 billion pixels. 

Zeiss started developing the instrument 
in 2000, when the company began explor- 
ing multi-beam SEM for the semiconductor 
industry in partnership with Applied Materi- 
als of Santa Clara, California. The aim was to 
develop technology that could inspect semi- 
conductors using partially automated, paral- 
lel imaging. A few years later, Zeiss realized 
the potential of this technology for the life 
sciences, with the most interest coming from 
neuroscientists who wanted high-resolution 
brain mapping. 


ENERGY BOOST 

At microscope company FEI, engineers are 
souping up SEMs in a different way, using a 
single electron beam. The prototype of a new 
multi-energy deconvolution SEM is currently 
undergoing testing at the company, says Ben 
Lich, who handles business development for 
the life sciences at FEI. This instrument, too, is 
intended to speed up neuronal mapping. 

Lich and his team have long worked with 
neuroscientists to optimize the standard com- 
ponents of SEMs. The US BRAIN Initiative 
and other ventures are encouraging the com- 
pany to “do a bit more” for neuroscience, Lich 
says, so that researchers in the field can extract 
more information from their samples. 

Current technology has a hard time imaging 
the more slender processes. To see cell bodies 
and axons, tissue sections need to be 30-40 nm 
thick, but to see the fine processes, they can 
be only 10-20 nm thick. Missing thinner 
branches in the imaging increases the risk of 
inaccuracies when reconstructing 3D circuits 
of connected neurons. 

To address this challenge, FEI’s new micro- 
scope performs ‘sub-surface imaging’ It takes 
multiple images of the same area of a sample 
and is able to image below the surface. It can 
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help scientists to track down finer processes 
that they may have missed at a different resolu- 
tion, Lich says. 

The instrument sends sequential electron 
beams into the sample at differing speeds 
— starting at 1 kilovolt and ramping up to 
around 2.5 kV. This way, it obtains more infor- 
mation from the sample than a conventional 
single-scan microscope can, Lich says. After 
it acquires images, the instrument applies a 
mathematical technique called deconvolu- 
tion, which separates the information from the 
different depths in the sample and produces 
virtual sections. 

The instrument lets researchers scan their 
samples with varying resolution. A first image 
at low magnification might reveal cell bodies, 
which are important for neurobiology but not 
for understanding neuronal connectivity. An 
area that is densely forested with finer pro- 
cesses might merit a high-resolution scan. 

Microscope manufacturers say that their 
experience in other fields helps with the chal- 
lenge of whole-brain 
mapping, which will 
involve imaging many 
samples for months at 
atime. FEI, for exam- 
ple, has microscope 
customers who are 
checking the quality 
of small structures on 
semiconductors or 
determining the asso- 
ciation of minerals 
and precious metals 
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in mining ores. Their Brain mapping is 
instruments run "oft just theold 
24 hours a day, seven neuroanatomy 
days a week for many gussied up with 
months, says Lich, mew machines”, 
and they hold up. says Jeff 
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of microscopes for neuroscience in particular 
also means helping to achieve better contrast 
from all samples, including those that might 
not be well stained. Lich says that FEI has 
designed its electron-beam column to opti- 
mize detection sensitivity. 

In juggling such factors as data quality, 
scanning speed and robustness, FEI’s team 
designed the multi-energy SEM so that its 
detectors for back-scattered electrons are engi- 
neered for speed, particularly at low energies. 
Using back-scatter signals is a more stable way 
to create images, Lich says. Secondary electron 
detectors are more prone to showing sample- 
charging problems, which is when the interac- 
tion between the electron beam and the sample 
leads to distorted images. In areas where the 
sample is charged, the acquired data can lead 
to ambiguous interpretations of neuronal con- 
nectivity. Back-scatter images are not as sensi- 
tive to this problem, he says. 

For neuron mappers, FEI’s method for SEM 
imaging offers a way to slice a sample virtu- 
ally before physically cutting it, Lich says. The 
microscope’s electron beam hits an area mul- 
tiple times, generating a virtual image stack. 
Rather than needing to physically slice tissue 
to much less than 40 nm thickness, the sample 
can be imaged ten times at 4-nm resolution to 
create a virtual slice of the desired width. The 
technique could reduce the need for ultrathin 
slicing and the consequent risk of sample 
damage and distortion. Physical slices could 
remain relatively thicker, and intermediate 
layers in the slice would still be resolved, says 
Lich. And for a scientist using Denk’s approach 
of imaging followed by slicing, the tissue can 
be virtually imaged several times to different 
depths before it is physically shaved off. 

The new SEM technology quickly creates 
issues of scalability for those who want to 
image large brain sections, Lich says. If scien- 
tists want data at 5-nm resolution, every voxel 
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black rings) and organelles such as mitochondria. Careful analysis reveals where neurons end and begin. 


—a3D pixel — must be 5 nm on each side. 
At this resolution, the FEI microscope can 
obtain images, but imaging a cube of brain tis- 
sue measuring 1 mmon each side would yield 
8 petavoxels of data — and those could take 
years to analyse, says Lich. 

In his view, the neuroscience community 
is pushing microscope development with its 
structural mapping projects. Some, he says, 
might call the plan to image the whole mouse 
brain outrageous. “In principle, nothing is 
impossible — but it is really an extraordinary 
challenge,” he says. 


DATA ANALYSIS 

As neuroscience ramps up its mapping efforts, 
the challenge of big data becomes more appar- 
ent. Mapping the human brain is as difficult 
as creating a 3D map ofa city 10,000 times 
the size of Tokyo, London or New York, and 
locating every inhabitant in every building, 
street, stairwell, lift and subway. To establish 
the same connectivity that neuroscientists are 
hoping to achieve, the mapping would then 
have to unearth all lines of communication — 
personal interactions as well as those by 
phone, post and e-mail — between all inhab- 
itants of these 10,000 cities, explains Moritz 
Helmstaedter at the Max Planck Institute of 
Neurobiology in Munich, Germany. 

Denk estimates that capturing images of a 
whole mouse brain with the new Zeiss SEM 
could deliver 60 petabytes of data. And Licht- 
man notes that such large amounts of data are 
going to be even more challenging to share 
than they are now. At present, he sends his 
microscopy data sets to the Open Connectome 
project at Johns Hopkins University in Balti- 
more, Maryland. The project’s website is an 
open repository for data accrued from differ- 
ent labs working on the brains of model organ- 
isms such as mice and nematodes. It enables 


anyone to download information from MRI, 
electron-microscopy and brain-wiring studies. 

Lichtman has one data set that is around 
100 terabytes in size. “How do you give that 
to somebody?” he asks. As data sets get even 
larger, the community will have to explore 
ways to make data available to scientists for 
perusal without download, he says. 

The next-generation microscopes can 
deliver the high-resolution images, but new 
approaches, including computational tools, 
are needed to build 3D wiring diagrams of 
neuronal circuits based on those images. 

Denk and colleagues in the United States 
and Germany have recently used SBEM to 
map a small section of mouse retina’. Even 
with computer software, tracing the neuronal 
pathways in the images would have taken 
vastly more time and more people than are 
available in a typical lab. The team turned to 
crowd-sourcing, enlisting the help of a host of 


Fluorescence microscopy can show many types of 
mouse neuron using the ‘Brainbow’ technique. 
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non-scientists sitting at their home computers. 

Software to help researchers with these 
tasks is being developed, and computer sci- 
entists plan to keep pace with the advances 
in microscope technology, Lichtman says. 
Image-analysis software under development 
includes tools with built-in machine learning 
and other types of algorithm to automate or 
semi-automate the tasks. 

Denk says that he used to think that 
researchers had to solve data-analysis prob- 
lems before imaging larger brain volumes and 
generating huge data sets. But his view has 
changed. “My current thinking is that we'll just 
produce a large volume,” Denk says. “Then, 
that will embarrass the analysis people into 
coming up with something.” 

Neuroscientist Sebastian Seung at the Massa- 
chusetts Institute of Technology in Cambridge, 
who collaborated on the mouse retina mapping 
study’, leads a group that launched the online 
game EyeWire. More than 80,000 volunteers 
worldwide have helped to trace the complete 
3D contours of neurons running through 
the game's electron-microscopy images. He 
is undaunted by the large data sets that brain 
mappers will be sending his way for analysis. 
“Why would we be frightened?” he says. “It is 
just an opportunity.’ Size in image data sets is 
a plus when imaging the mammalian nervous 
system. Imaging too small a section risks miss- 
ing neuronal connections, he says. 

Still, the challenge in mammals is formi- 
dable. When scientists image fruitfly brain 
tissue, a small imaging volume can capture 
much of a neuronal circuit, because many 
neurons do not run and branch over long 
distances, says Seung. In mammals, many 
researchers trace connections found in the 
retina, because much of the circuit can be 
captured with a small tissue volume, he says. 
But that is already a huge challenge — and it 
gets worse when tracing connections in the 
brain, because neuronal circuits are spread out 
over much larger volumes in mammals than 
in flies. “You can't really map out connections 
and circuits unless the volume you are imag- 
ing is reasonably large,” he says. 

Lichtman believes that the neurobiology 
community will ultimately find a way to gar- 
ner the required knowledge because research- 
ers stand to learn so much about the brain. 
“Without it, there are lots of mysteries,” he says. 
“With it, mysteries become just facts.” m 


Vivien Marx is technology editor for Nature 
and Nature Methods. 
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The neuronal pathways in the brain’s white matter are picked out in this magnetic resonance image. 


A head start for 
brain imaging 


With several big data projects under way, there is growing 
demand for neuroimaging expertise. 


BY VIRGINIA GEWIN 


( orey White felt pretty fortunate during 
his job search late last year. Over the 
course of 4 months, he found at least 

25 posts to apply for — even after he had fil- 

tered the possibilities to places where his wife 

also had job prospects. Competition for the 
jobs was, as he expected, fierce, but he secured 
three interviews. In the end, he says, it was his 
skills in functional magnetic resonance imag- 
ing (fMRI) that helped him to clinch a post at 

Syracuse University in New York, where they 

were eager to elevate their neuroscience profile. 
The human brain is something of an 

enigma. Much is known about its physical 


structure, but quite how it manages to marshal its 
myriad components into a powerhouse capable 
of performing so many different tasks remains a 
mystery. Neuroimaging offers one way to help 
find out, and universities and government ini- 
tiatives are betting on it. Already, an increasing 
number of universities across the United States 
and Europe are buying scanners dedicated to 
neuroimaging — a clear signal that the area is 
set for growth. “Institutions feel an imperative 
to develop an imaging programme because 
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everybody's got to have one to be competitive,” 
says Mark Cohen, an imaging pioneer at the 
Semel Institute for Neuroscience and Human 
Behavior at the University of California, Los 
Angeles. 

At the same time, a slew of major projects 
focusing on various aspects of the brain is 
seeking to paint the most comprehensive pic- 
ture yet of the organ’s organizing principles — 
from genes to high-level cognition. Asa result, 
young scientists with computational expertise, 
a fluency in multiple imaging techniques and a 
willingness to engage in interdisciplinary col- 
laborations could readily carve out a career in 
this dynamic landscape. 


INSIDE THE MIND 

Several imaging techniques are experiencing 
fresh demand. The development of tracers that 
bind to the B-amyloid proteins that form the 
plaques associated with Alzheimer’s disease 
has reinvigorated the use of positron emis- 
sion tomography (PET), for example. PET 
scans can now be used to detect the earliest 
stages of Alzheimer’s disease, which could aid 
the development of effective treatments. And 
fMRI, which detects changes in blood flow, is 
now being used to identify neural networks 
involved in cognitive function, to recognize 
and diagnose neurodegenerative diseases and 
to assess treatments or therapies (see Nature 
484, 24-26; 2012). Researchers have also com- 
bined fMRI skills with novel optogenetic tech- 
niques, which use genetic controls to activate 
specific neurons in order to track neuronal 
activity in live animals. 

These imaging techniques are being pulled 
together in a number of big projects that aim 
to integrate findings on the molecular, cel- 
lular and whole-brain scales. In January, the 
European Commission announced that it 
would fund the 10-year Human Brain Pro- 
ject at roughly €1 billion (US$1.4 billion); the 
project has the ambitious aim of producing 
realistic computer simulations of the brain, 
all the way from individual cells to the entire 
organ (see Nature http://doi.org/phz; 2013). 
Some 250 scientists from 80 groups are 
involved so far, says Richard Frackowiak of the 
University of Lausanne in Switzerland, who is 
one of the project’s co-leaders. He expects that 
number to grow in future funding rounds. 

In the United States, the US$110-million 
Brain Research through Advancing Innova- 
tive Neurotechnologies (BRAIN) Initiative, 
announced in April, will map the brain's activ- 
ity in exquisite detail, with imaging likely > 
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> tobeamong the funding 
priorities (see page 26). 

Technological advances 
in neuroimaging are the 
foundation of other big, 
interdisciplinary collabo- 
rative efforts, such as the 
Human Connectome 
Project, funded by the 
US National Institutes 
of Health (NIH), which is 
mapping the brain’s structural 
and functional neural connec- 
tions; and the European Com- 
mission's IMAGEN project, which is 
scanning the brains of 2,000 teenagers — 
at the age of 14 and again at 18 — to seek links 
between brain structure, risk-taking behaviour 
and mental health. “Even though studying 
diseases is a big driver in neuroscience, we've 
put increasing effort into understanding how 
healthy brains work,’ says IMAGEN scientist 
Christian Biichel, director of the neuroimag- 
ing centre at the University Medical Center 
Hamburg-Eppendorf in Germany. 

Although these big projects offer significant 
career opportunities, there is a potential down- 
side that researchers need to keep in mind, says 
Ray Dolan, director of the Wellcome Trust 
Centre for Neuroimaging at University College 
London. Search committees typically place a 
premium on independence and individual 
initiative. “I worry about top-down initiatives 
creating an army of workers who, when the 
projects are done, are no longer attractive as 
potential hires by universities because they 
haven't developed the ability to generate their 
own ideas,’ he says. Even so, Dolan believes 
that neuroimaging skills allow researchers to 
cross disciplinary boundaries relatively easily. 


BEYOND NEUROSCIENCE 

The university neuroimaging boom is not con- 
fined to neuroscience departments. There is 
a real hunger for ways to quantify brain and 
behaviour in psychology and psychiatry, says 
Graeme Mason, director of the neuroimaging- 
sciences training programme at Yale School of 
Medicine in New Haven, Connecticut. 

Most of the faculty positions in estab- 
lished disciplines, such as cognitive psy- 
chology, require neuroimaging skills. And 
neuroimaging has fuelled the rapid growth 
of subfields such as neuroeconomics, which 
looks at human decision-making, and social 
neuroscience, which studies disorders and 
the development of human social behaviour. 
Todd Heatherton, a cognitive neuroscientist at 
Dartmouth College in Hanover, New Hamp- 
shire, and a frequent job-search chair, says that, 
until recently, social neuroscience had more 
jobs available than there were truly qualified 
applicants — those with both the computa- 
tional skills and an appreciation of social psy- 
chology. Competition for the top applicants 
remains fierce, he says. 


Neuroimages explore real-time 
views into cerebral function. 


For many young neu- 
roscientists, the biggest 
concern is the lack-lustre 
state of federal funding in 
some countries, especially 
given that scanning pro- 
grammes are costly to get 
up and running. At Syra- 

cuse University, White is 
one of the few people doing 
fMRI studies. Given that the 
facilities he uses are not solely 
dedicated to research, he fears 
that he would struggle to meet the 
criteria for an NIH grant to fund his 
work on cognitive behaviour. So he is trying to 
bolster his chances by setting up cross-discipli- 
nary collaborations that take advantage of local 
expertise in genetics, cardiovascular health or 
exercise science. Already he has been talking to 
an exercise scientist about relating the metrics 
of artery stiffness to cognitive processing. 

Itis not a bad strategy. Carolyn Asbury, a con- 
sultant for the Dana Foundation, a non-profit 
organization in New York city that is dedicated 
to neuroscience, says that funding applications 
have become increasingly interdisciplinary 
and collaborative during the past decade, often 
combining different imaging techniques. The 
foundation allocates at least half of its annual 
funds of some $2.8 million to support budding 
neuroscientists starting careers. It is particularly 
interested in proposals involving cellular and 
molecular imaging techniques, such as devel- 
oping new tracers for use in PET. 

Funders are also increasingly interested in 
applied clinical uses of neuroimaging tech- 
niques, which open yet more avenues for 
collaboration. For example, the European 
Commission in September put €4.2 million 
into a €6-million project called METSY. This 
aims to identify neuroimaging biomarkers of 
psychosis and to study the links between psy- 
chotic disorders, such as schizophrenia, and 
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Corey White hopes to build interdisciplinary collaborations that capitalize on his fMRI skills. 


other conditions, such as obesity and diabetes. 

In Switzerland, the Translational Neuro- 
modeling Unit at the University of Zurich has 
brought clinicians, neuroimagers and compu- 
tational analysts together under one roof to 
develop diagnostic tools based on models to 
offer treatment recommendations and predict 
clinical outcomes for psychiatric patients. And 
in September the Kessler Foundation in West 
Orange, New Jersey, opened the first inde- 
pendent neuroimaging-focused rehabilita- 
tion institute in the United States, where it will 
conduct research and clinical trials aimed at 
bolstering recovery from brain trauma. 

Companies also see the potential for imag- 
ing to become a staple of health care and drug 
development. Bin He, director of the Center 
for Neuroengineering at the University of Min- 
nesota in Minneapolis, says that his graduate 
students sometimes end up working at major 
imaging companies. Philips recruited one of 
his students last year, he says, and compa- 
nies such as Siemens, General Electric and 
Medtronic are also looking for neuroimaging 
talent. Russell Schramm, head of talent acqui- 
sition for Philips North America, says that 
imaging systems will be a key area of focus for 
the company’s health-care business over the 
next five years. Projects such as the Alzheimer’s 
Disease Neuroimaging Initiative — an effort 
funded in part by the Alzheimer’s Association 
to study changes in cognition, function and 
brain structure in elderly people — could have 
an impact in the clinic and spark interest from 
industry, he says. 


SURVIVAL SKILLS 

To capitalize on the opportunities for neuro- 
imaging, it is important to gain the appropriate 
statistical and computational skills. This means 
taking advanced statistics courses and learning 
a programming language. 

“Neuroimaging is now seen as a quantita- 
tive field — built on real data, not clever par- 
lour tricks — that has an impact in a variety 
of areas of medicine,” says Cohen. Computa- 
tional skills, including multimodal analyses, 
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signal processing and parametric statistical 
modelling, are key to successful analyses. “It 
takes quite a bit of skill to look at data and 
identify what is artefact versus what is real 
and important,’ Cohen says. 

Russell Poldrack, director of the Imaging 
Research Center at the University of Texas 
at Austin, doesn’t even bother to interview 
applicants who can't write computer code. 
Data sets are getting so big that trying to 
analyse them without using automated 
methods simply takes too long, he says. 
Off-the-shelf software packages tend not to 
lend themselves to the most interesting and 
novel analysis methods, Poldrack notes, so 
he wants people who can write programs 
that can do those analyses. 

Computational skills have become so 
important that, in July, Dartmouth Col- 
lege hired Alireza Soltani, a computational 
neuroscientist with a background in theo- 
retical physics, to model other people's data 
rather than collect his own. Soltani uses 
detailed computational modelling at the 
synaptic, cellular and network levels to look 
for mechanisms that help to explain behav- 
iour and cognition. This hiring strategy is 
necessary to ensure that the college has the 
people “with the skills to advance the field 
beyond pretty pictures’, says Heatherton. 

As universities increase their neuro- 
imaging capacity, they are facing a recruit- 
ment challenge. “The applicant pool for 
neuroimaging posts is diversified and of 
variable quality,” says Kamil Ugurbil, direc- 
tor of the Center for Magnetic Resonance 
Research at the University of Minnesota. 
It can be difficult to find good people with 
rigorous computational skills in neuro- 
imaging, he notes. At the same time, physi- 
cists, engineers and statisticians can have 
quantitative skills but lack the psychology 
or life-sciences background for a neuro- 
imaging-related post. 

Even those who can offer the right mix 
of skills should consider gaining expertise 
in more than one imaging technique. “We 
all need to specialize to some extent, but 
it’s more advantageous to be well versed 
in a range of complementary techniques 
to avoid being labelled as just the PET guy 
or just the {MRI guy,” says Brian Bacskai, 
a neuroscientist at Massachusetts General 
Hospital in Boston, whose work combines 
cellular and molecular imaging techniques 
to study Alzheimer’s disease. 

Bacskai reiterates the importance of 
collaboration for neuroimaging. “Cross- 
disciplinary studies are the only way to 
move the field forward,’ he says. “I think 
institutions will look at how good appli- 
cants are at working with others as a key 
factor in a CV or tenure package.” m 


Virginia Gewin is a freelance writer based 
in Portland, Oregon. 
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Postdoc’s torch song 


A love affair with research can be just as heartbreaking 
as romantic love, says Christopher Schmitt. 


aybe it was never meant to be. 
Such was my resigned perspective 
when I was dumped by a boyfriend 


while conducting my doctoral field research. I 
had been studying the behaviour of wild spider 
monkeys and woolly monkeys in the Amazon 
for six months, and as the data had been slowly 
coming in, my relationship had been slowly 
deteriorating. Communication had devolved 
from lengthy nightly video chats — me provid- 
ing virtual tours of the field station and images 
of monkeys, him displaying Manhattan sky- 
scapes — toa slow fade of curt, non-committal 
missives that bred a terrible uncertainty. When 
I saw him smiling with an unfamiliar man on 
my social-network newsfeed, I knew which 
way the wind was blowing. 

But it was OK. I was getting my data. I was 
working towards a doctorate and an academic 
career, something bigger than that relation- 
ship. And I knew that once I recovered from 
being dumped by instant message, I could find 
another man — a better man! — who would be 
more supportive of my increasingly demand- 
ing relationship with my work. My research 
seemed to offer the affirmation and validation 
that my personal relationships had stopped 
providing. After all, I thought, it could never 
cut me off the way my now ex-boyfriend had. 

I did get my doctoral degree. I got grants. I 
published. I landed a promising postdoc post 
across the country from my graduate school; I 
could check out a new dating pool while leaning 
on my more gratifying love affair with science. 

Now I am three years into a postdoc that 
has been not quite as productive as promised. 
The wait for overdue data from collaborators 
has made me miss my funding deadlines. 
When I do pitch my research to funders and 


foundations, they send me curt summary 
statements wondering about my lack of post- 
doctoral publications. Letters to tenure-track 
university job-search committees come back 
with polite but deflating responses. 

It’s the slow fade all over again. 

Of course, my challenges are not uncom- 
mon. Postdocs in the current market, espe- 
cially in the life sciences, have little chance 
of landing the tenure-track posts they had in 
mind when they started out. The dire situa- 
tion in the US job market was exacerbated by 
a massive increase in trainees in the early 2000s 
with no increase in faculty positions, and may 
well be amplified by sequestration, that pack- 
age of deep cuts to US federal agencies that 
took effect in March and has no end in sight. 

So it seems that the most important rela- 
tionship in my life, the one I have put above 
all others — my love for academic research 
— is falling apart, owing in no small measure 
to circumstances beyond my control. As this 
happens, I think about all the time I have spent 
away from family and friends, and all the rela- 
tionships that have fallen by the wayside as I 
moved from state to state to build my career. 

Academia has been a difficult love. It seems 
to have made some promises it did not intend to 
keep, and no amount of torch-song singing will 
make the potential loss feel any less dire. I need 
to remember the refrain from Joni Mitchell’s A 
Case of You: having drunk my fill of academia, 
Ican still be on my feet. If my career comes to 
an end now, maybe I can keep my heart intact, 
and move on. 

But until it comes to that, I will hold on to this 
love. Maybe, despite the chilly job climate and 
data delays, my career can still flourish. Why 
else would I have given up so much? I continue 
to submit papers and grant applications. ’m 
looking for postdoc posts or teaching positions 
that will keep me in the game until the tenure 
track comes courting. 'm throwing away my 
misgivings, refusing to become resigned to a 
life outside academia, and hoping beyond hope 
that this love is the one that will not leave me. 

And if it does? 

Well, I'll defer again to Joni: “Ifyou want me 
Tllbe in the bar...” = 


Christopher Schmitt is a postdoctoral 
research fellow in the Center for 
Neurobehavioral Genetics at the University of 
California, Los Angeles. 
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IMMEASURABLE 


BY H.E. ROULO 


geology class, rolling up their flexible 

displays and shoving them into back- 
packs. Yvonne lingered outside, dreading 
what came next: she needed to press the 
button. 

The funeral home had just sent her 
a microfluidic hot key that would take 
action with one nail press. The new but- 
ton rose like a blister on the band of the 
flesh-coloured Metric that clung to her 
wrist. Her racing heartbeat and clammy 
skin revealed her anxiety, casting her Meta- 
score down with each moment she delayed, 
but her finger refused to tap — even if it 
meant losing points. 

Peggy careered around the corner of the 
building, blonde hair brushing the wolf 
on her airbrushed T-shirt. She grinned at 
Yvonne. “Howd you do today?” 

Yvonne glanced at her Metric as if she 
hadn't been staring at it. “I scored basic 
learning and self-actualization points. Just 
another weekday in class.” 

Peggy frowned. “No class participation 
bonus? That's not like you. Hey, let’s walk 
the long way back to the dorms. I’m short 
on exercise.” 

Across campus, a student in a college 
sweatshirt threw a Frisbee to a girl ina tank 
top. Other students sat beneath a tree, lis- 
tening to a guitar. Theyd all be filling their 
socializing bars, and it looked like fun. On 
another day, in another mood, Yvonne 
might have joined them. 

Peggy’s touch startled her. “Hello? You're 
barely here.” 

“Brian —; Yvonne choked on her brother's 
name, and watched the Frisbee fly into the 
bushes. “His ashes are ready to be dispersed.” 

The girl in the tank top fished in the 
undergrowth for the Frisbee and stood up 
with a glad cry, a wrapper flapping in her 
hand. Her partner groaned, obviously wish- 
ing he'd found it. The girl dropped the litter 
into a bin, Metric dinging. 

Peggy pouted. “Damn, the college 
announced a green initiative, and bonus goals 
like community improvement. She probably 
got 500 points. I’m behind for the day. I didn't 
get up with my alarm and I skipped breakfast. 
I chose pizza Friday night as my reward — 
doesnt look like ’'m going to get it?” 

“On the other hand, you don't have to deal 
with your dead brother’s remains. Bonus, 
right?” Yvonne pinched her lips shut. Since 


See streamed out of Yvonne's 


Off the scale. 


Brian’s accident shed learned she shouldn't 
resent how the world went on as if nothing 
had changed. For them, it hadn't. She scuffed 
the ground with her foot. 

Peggy shrugged. 

Theyd circled round on the asphalt path 
and the dorms stood a block away. Yvonne 
wished she was already there, tucked under 
the blankets. She'd lose points, of course. 
Since arriving at school she'd focused her 
Metric on rewarding education and new 
experiences. Peggy used hers to avoid her 
freshman fifteen. But their concentration on 
scoring drew them together. Nothing felt as 
good as the chime for a new high score. 

“Doesnt your Mum want to handle his 
ashes? That’s a major emotional milestone. 
It’s a level-up for sure!” 

“Mum wouldnt use the points. She relies 
ona cell phone. I tried to set a Metric up for 
her, but she never got into it. She says her 
MMO isa better game.” 

“Old-fashioned! They’re such a different 
generation, escaping into TV, video games 
and movies. Hanging out with friends 
online. Real-world accomplishments are 
so much more fulfilling, I mean, as long as 
you've set your incentives right. 

“Tve hit my exercise goal. Let's cross the 
street, itll be shorter.” 

Cyclists rode by, their Metrics ticking 

up health and social 
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Id do it. I don’t know why I'm making a fuss. 
It’s just a button.” Yvonne raised her wrist. 
Why delay? There were no extra points for 
doing it in person. Stomach clenching, 
her index finger hovered but would not 
descend. Although she'd said goodbye at 
the memorial — held at a virtual church 

so friends from all over could mourn 

together — this would be final. Real. 

Peggy halted. “I don’t know what it’s 
like to lose a family member, but I know 
it’s tough. You've been strong, and you're 
helping your family by taking this on” 

Tears blurred the bottom of Yvonne's 
vision. She embraced Peggy. 

Peggy squeezed back. She sniffed 
and wiped a hand across her nose. Her 
Metric dinged and her eyes brightened. 

“Social connection! I was totally there for 
a friend. Look, I’m ahead for the day.’ She 
gave Yvonne's unresisting shoulders another 
squeeze. “And usually I score low on empa- 
thy. See you later. Thanks for making it a 
great day!” 

Peggy bounded towards her dorm, 
humming. 

Yvonne's fingers stroked the bubbled key 
of her Metric. One press, and she'd com- 
plete a mega-life task. Shed level-up. Some 
people were born rich, and exotic trips set 
their experiences scores ahead. But with 
this, shed be ahead of most of her peers. She 
could gain something nice from a painful 
life event. Brian would like that. 

But she didn't want to score points for this. 

“Peggy!” she called. 

The girl stopped on the steps. 

Yvonne slipped the Metric from her wrist. 
The thin coil dangled between her fingers, 
innocent of the weight she'd given it. She 
tossed the Metric into the bushes and heard 
a sad chime — lost points. 

“OMG, what are you doing?” Peggy yelled. 

Vertigo flooded Yvonne and her knees 
threatened to buckle as she tried to orien- 
tate herself towards the mortuary without 
her Metric to guide her. Once familiar build- 
ings hid the way, their configuration con- 
fusing to a sleepwalker who'd awakened. On 
the second step towards her goal, her stride 
strengthened. Over her shoulder she called: 
“There's litter in the bushes if you want it!” 

Peggy squealed. A Metric chimed. = 


H. E. Roulo is a Pacific Northwest author 
whose stories have appeared in a dozen 
podcasts, magazines and anthologies 
(www.heroulo.com). 
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